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Preface 


This book is based on lecture notes which I have used over a number of years 
to teach a course on mathematical methods to senior undergraduate students 
of mathematics at King Saud University. The course is offered here as a prereq- 
uisite for taking partial differential equations in the final (fourth) year of the 
undergraduate program. It was initially designed to cover three main topics: 
special functions, Fourier series and integrals, and a brief sketch of the Sturm- 
Liouville problem and its solutions. Using separation of variables to solve a 
boundary-value problem for a second-order partial differential equation often 
leads to a Sturm-—Liouville eigenvalue problem, and the solution set is likely to 
be a sequence of special functions, hence the relevance of these topics. Typi- 
cally, the solution of the partial differential equation can then be represented 
(pointwise) by a Fourier series or a Fourier integral, depending on whether the 
domain is finite or infinite. 


But it soon became clear that these “mathematical methods” could be de- 
veloped into a more coherent and substantial course by presenting them within 
the more general Sturm—Liouville theory in £?. According to this theory, a 
linear second-order differential operator which is self-adjoint has an orthogonal 
sequence of eigenfunctions that spans CL”. This immediately leads to the funda- 
mental theorem of Fourier series in £? as a special case in which the operator is 
simply d?/dx?. The other orthogonal functions of mathematical physics, such 
as the Legendre and Hermite polynomials or the Bessel functions, are similarly 
generated as eigenfunctions of particular differential operators. The result is a 
generalized version of the classical theory of Fourier series, which ties up the 
topics of the course mentioned above and provides a common theme for the 
book. 
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In Chapter 1 the stage is set by defining the inner product space of square 
integrable functions £?, and the basic analytical tools needed in the chapters 
to follow. These include the convergence properties of sequences and series of 
functions and the important notion of completeness of £7, which is defined 
through Cauchy sequences. 


The difficulty with building Fourier analysis on the Sturm—Liouville the- 
ory is that the latter is deeply rooted in functional analysis, in particular the 
spectral theory of compact operators, which is beyond the scope of an under- 
graduate treatment such as this. We need a simpler proof of the existence and 
completeness of the eigenfunctions. In the case of the regular Sturm-—Liouville 
problem, this is achieved in Chapter 2 by invoking the existence theorem 
for linear differential equations to construct Green’s function for the Sturm- 
Liouville operator, and then using the Ascoli-Arzela theorem to arrive at the 
desired conclusions. This is covered in Sections 2.4.1 and 2.4.2 along the lines 
of Coddington and Levinson in [6]. 


Chapters 3 through 5 present special applications of the Sturm—Liouville 
theory. Chapter 3, which is on Fourier series, provides the prime example of a 
regular Sturm—Liouville problem. In this chapter the pointwise theory of Fourier 
series is also covered, and the classical theorem (Theorem 3.9) in this context 
is proved. The advantage of the £? theory is already evident from the simple 
statement of Theorem 3.2, that a function can be represented by a Fourier 
series if and only if it lies in £?, as compared to the statement of Theorem 3.9. 


In Chapters 4 and 5 we discuss some of the more important examples of 
a singular Sturm—Liouville problem. These lead to the orthogonal polynomials 
and Bessel functions which are familiar to students of science and engineer- 
ing. Each chapter concludes with applications to some well-known equations 
of mathematical physics, including Laplace’s equation, the heat equation, and 
the wave equation. 


Chapters 6 and 7 on the Fourier and Laplace transformations are not really 
part of the Sturm—Liouville theory, but are included here as extensions of the 
Fourier series method for representing functions. These have important appli- 
cations in heat transfer and signal transmission. They also allow us to solve 
nonhomogeneous differential equations, a subject which is not discussed in the 
previous chapters where the emphasis is mainly on the eigenfunctions. 

The reader is assumed to be familiar with the convergence properties of 
sequences and series of functions, which are usually presented in advanced cal- 
culus, and with elementary ordinary differential equations. In addition, we have 
used some standard results of real analysis, such as the density of continuous 
functions in £? and the Ascoli—Arzela theorem. These are used to prove the exis- 
tence of eigenfunctions for the Sturm—Liouville operator in Chapter 2, and they 
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have the advantage of avoiding any need for Lebesgue measure and integration. 
It is for that reason that smoothness conditions are imposed on the coefficients 
of the Sturm-Liouville operator, for otherwise integrability conditions would 
have sufficed. The only exception is the dominated convergence theorem, which 
is invoked in Chapter 6 to establish the continuity of the Fourier transform. 
This is a marginal result which lies outside the context of the Sturm-Liouville 
theory and could have been handled differently, but the temptation to use that 
powerful theorem as a shortcut was irresistible. 


This book follows a strict mathematical style of presentation, but the sub- 
ject is important for students of science and engineering. In these disciplines, 
Fourier analysis and special functions are used quite extensively for solving 
linear differential equations, but it is only through the Sturm-Liouville theory 
in £? that one discovers the underlying principles which clarify why the proce- 
dure works. The theoretical treatment in Chapter 2 need not hinder students 
outside mathematics who may have some difficulty with the analysis. Proof of 
the existence and completeness of the eigenfunctions (Sections 2.4.1 and 2.4.2) 
may be skipped by those who are mainly interested in the results of the theory. 
But the operator-theoretic approach to differential equations in Hilbert space 
has proved extremely convenient and fruitful in quantum mechanics, where it 
is introduced at the undergraduate level, and it should not be avoided where 
it seems to brings clarity and coherence in other disciplines. 


I have occasionally used the symbols = (for “implies”) and = (for “if and 
only if’) to connect mathematical statements. This is done mainly for the 
sake of typographical convenience and economy of expression, especially where 
displayed relations are involved. 


A first draft of this book was written in the summer of 2005 while I was 
on vacation in Lebanon. I should like to thank the librarian of the American 
University of Beirut for allowing me to use the facilities of their library during 
my stay there. A number of colleagues in our department were kind enough 
to check the manuscript for errors and misprints, and to comment on parts of 
it. I am grateful to them all. Professor Saleh Elsanousi prepared the figures 
for the book, and my former student Mohammed Balfageh helped me to set 
up the software used in the SUMS Springer series. I would not have been able 
to complete these tasks without their help. Finally, I wish to express my deep 
appreciation to Karen Borthwick at Springer-Verlag for her gracious handling 
of all the communications leading to publication. 


M.A. Al-Gwaiz 
Riyadh, March 2007 
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Inner Product Space 


An inner product space is the natural generalization of the Euclidean space 
R”, with its well-known topological and geometric properties. It constitutes 
the framework, or setting, for much of our work in this book, as it provides the 
appropriate mathematical structure that we need. 


1.1 Vector Space 


We use the symbol F to denote either the real number field R or the complex 
number field C. 


Definition 1.1 


A linear vector space, or simply a vector space, over F is a set X on which two 


operations, addition 
+:X xX X, 


and scalar multiplication 
-:Fx xX >X, 


are defined such that: 
1. X is a commutative group under addition; that is, 


(a) x+y =y+x forall x,y EX. 


(b) x+(y+z) =(x+y)+z forall x,y,z € X. 
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(c) There is a zero, or null, element O € X such that x + 0 =x for all 
x EX. 


(d) For each x € X there is an additive inverse —x EX such that 
x+(—x) =0. 


2. Scalar multiplication between the elements of F and X satisfies 
(a) a-(b-x) =(ab)-x for all a,b € F and all x €X, 
(b) 1-x=x forall x €X. 
3. The two distributive properties 
(a) a- (x+y) =a-xta-y 
(b) (a+b) -x =a-x+b-x 
hold for any a,b € F and x,y €X. 


X is called a real vector space or a complex vector space depending on 
whether F = R or F = C. The elements of X are called vectors and those 
of F scalars. 


From these properties it can be shown that the zero vector O is unique, 
and that every x € X has a unique inverse —x. Furthermore, it follows that 
0-x = 0 and (—1) -x = —x for every x € X, and that a- 0 = 0 for every a € F. 
As usual, we often drop the multiplication dot in a - x and write ax. 


Example 1.2 
(i) The set of n-tuples of real numbers 
R” = {(£1,..., £n) : xi E R}, 
under addition, defined by 
(£1,...,£n) + (Y1, --,Yn) = (z1 + Y1,- --, Ln + Yn), 
and scalar multiplication, defined by 
a: (£1,..., 2n) = (A%1,..., a0), 
where a € R, is a real vector space. 


(ii) The set of n-tuples of complex numbers 


C” = {(21,.--,2n) : zi E C}, 
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on the other hand, under the operations 


(21,---;2n) + (wi,---,Wn) = (21 + W1,.--,2n + Wn), 


is a complex vector space. 
(iii) The set C” over the field R is a real vector space. 


(iv) Let I be a real interval which may be closed, open, half-open, finite, or 
infinite. P(I) denotes the set of polynomials on I with real (complex) coeffi- 
cients. This becomes a real (complex) vector space under the usual operation 
of addition of polynomials, and scalar multiplication 


b- (dna” + +++ +a," + ao) = bant” +--+ + baiz + bao, 
where b is a real (complex) number. We also abbreviate P(R) as P. 
(v) The set of real (complex) continuous functions on the real interval I, which 
is denoted C(I), is a real (complex) vector space under the usual operations 
of addition of functions and multiplication of a function by a real (complex) 
number. 
Let {x1,...,Xn} be any finite set of vectors in a vector space X. The sum 


n 
ayxy Ke? +H anXn = >) ax, a; EF, 
i=l 


is called a linear combination of the vectors in the set, and the scalars a; are 
the coefficients in the linear combination. 


Definition 1.3 


(i) A finite set of vectors {x1,...,X,} is said to be linearly independent if 


aixi =O>a;,=0 for allie {1,...,n}, 


M: 


iSl 


that is, if every linear combination of the vectors is not equal to zero except 


when all the coefficients are zeros. The set {X1,..., Xn} is linearly dependent 
if it is not linearly independent, that is, if there is a collection of coefficients 
@1,---,@n, not all zeros, such that 57>, a;x; = 0. 


(ii) An infinite set of vectors {x,,x2,x3,...} is linearly independent if every 
finite subset of the set is linearly independent. It is linearly dependent if it is 
not linearly independent, that is, if there is a finite subset of {x,,x2,xX3,...} 
which is linearly dependent. 
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It should be noted at this point that a finite set of vectors is linearly depen- 
dent if, and only if, one of the vectors can be represented as a linear combination 
of the others (see Exercise 1.3). 


Definition 1.4 


Let X be a vector space. 

(i) A set A of vectors in X is said to span X if every vector in X can be 
expressed as a linear combination of elements of A. If, in addition, the vectors 
in A are linearly independent, then A is called a basis of X. 

(ii) A subset Y of X is called a subspace of X if every linear combination of 
vectors in Y lies in Y. This is equivalent to saying that Y is a vector space in 
its own right (over the same scalar field as X). 


If X has a finite basis then any other basis of X is also finite, and both 
bases have the same number of elements (Exercise 1.4). This number is called 
the dimension of X and is denoted dim X. If the basis is infinite, we take 
dim X = oo. 

In Example 1.2, the vectors 


e1 = (1,0,...,0), 
e2 = (0,1,0,...,0), 


e, = (0,...,0,1) 
form a basis for R” over R and C” over C. The vectors 


dı = (i,0,...,0), 
dy = (0,i,0,...,0), 


dn = (0,...,0,4), 


together with e),...,e,, form a basis of C” over R. On the other hand, the 
powers of x € R, 


2 23 
Ls AsO ee he hc 


span P and, being linearly independent (Exercise 1.5), they form a basis for 
the space of real (complex) polynomials over R (C). Thus both real R” and 
complex C” have dimension n, whereas real C” has dimension 2n. The space 
of polynomials, on the other hand, has infinite dimension. So does the space of 
continuous functions C(I), as it includes all the polynomials on I (Exercise 1.6). 
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Let Pa(I) be the vector space of polynomials on the interval I of degree 
< n. This is clearly a subspace of P(I) of dimension n + 1. Similarly, if we 
denote the set of (real or complex) functions on I whose first derivatives are 
continuous by C1(/), then, under the usual operations of addition of functions 
and multiplication by scalars, C1(I) is a vector subspace of C(I) over the 
same (real or complex) field. As usual, when J is closed at one (or both) of 
its endpoints, the derivative at that endpoint is the one-sided derivative. More 
generally, by defining 


C”(I) ={f € CU): f™ € CU), ne N}, 
c>) = f D, 
we obtain a sequence of vector spaces 
C(I) > CHI) > C?(I) > -> C(I) 


such that C*(T) is a (proper) vector subspace of C™(I) whenever k > m. Here 
N is the set of natural numbers {1,2,3,...} and No = NU {0}. The set of 
integers {...,—2,—1,0,1,2,...} is denoted Z. If we identify C°(I) with C(I), 
all the spaces C"(I), n € No, have infinite dimensions as each includes the 
polynomials P(I). When J = R, or when J is not relevant, we simply write C”. 


EXERCISES 


1.1 Use the properties of the vector space X over F to prove the foll- 
owing. 


(a) 0-x=0 for all x EX. 


(b) a-0=0 forallacF. 
(c) (-1)-x = -x for all x €X. 
(d) If a-x =O then either a = 0 or x = 0. 


1.2 Determine which of the following sets is a vector space under the 
usual operations of addition and scalar multiplication, and whether 
it is a real or a complex vector space. 


(a) Pn(Z) with complex coefficients over C 

(b) P(T) with imaginary coefficients over R 
(c) The set of real numbers over C 
( 


d) The set of complex functions of class C"(I) over R 
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1.3 Prove that the vectors x,,...,x,, are linearly dependent if, and only 
if, there is an integer k € {1,...,n} such that 


Xk = do aX, a; E€ F. 
iFk 
Conclude from this that any set of vectors, whether finite or infinite, 


is linearly dependent if, and only if, one of its vectors is a finite linear 
combination of the other vectors. 


1.4 Let X be a vector space. Prove that, if A and B are bases of X and 
one of them is finite, then so is the other and they have the same 
number of elements. 


1.5 Show that any finite set of powers of x, {1,x7,x7,...,a" : © € 
I}, is linearly independent. Hence conclude that the infinite set 
{1,a,x7,...: x € I} is linearly independent. 


1.6 If Y is a subspace of the vector space X, prove that dim Y < dim X. 
1.7 Prove that the vectors 


x1 >= (tiisa Tin) 


Xn = (Erias inn) 


where x;; € R, are linearly dependent if, and only if, det(a,;) = 0, 
where det(2;;) is the determinant of the matrix (z£;;). 


1.2 Inner Product Space 


Definition 1.5 


Let X be a vector space over F. A function from X x X to F is called an inner 
product in X if, for any pair of vectors x,y €X, the inner product (x,y) = 
(x,y) € F satisfies the following conditions. 


(i) (x,y) = (y, x) for all x,y EX. 

(ii) (ax+by,z) = a(x,z)+b(y,z) for all a,b E€ F, x,y,z € X. 

(iii) (x, x) > 0 for all x €X. 

(iv) (x,x)=0 © x=0. 

A vector space on which an inner product is defined is called an inner product 
space. 
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The symbol (y,x) in (i) denotes the complex conjugate of (y,x), so that 
(x,y) = (y,x) if X is a real vector space. Note also that (i) and (ii) imply 
(x,ay) = (ay,x) = ā (x,y), 


which means that the linearity property which holds in the first component of 
the inner product, as expressed by (ii), does not apply to the second component 
unless F = R. 


Theorem 1.6 (Cauchy—Bunyakowsky—Schwarz Inequality) 


If X is an inner product space, then 


I(x, y)|? < (x,x) (y,y) forall x,y €X. 


Proof 


If either x = 0 or y = 0 this inequality clearly holds, so we need only consider 
the case where x £0 and y £0. Furthermore, neither side of the inequality 
is affected if we replace x by ax where |a| = 1. Choose a so that (ax,y) is 
a real number; that is, if (x,y) = |(x,y)|e’’, let a = e~"*. Therefore we may 
assume, without loss of generality, that (x,y) is a real number. Using the above 
properties of the inner product, we have, for any real number t, 


0 < (x+ty,x+ty) = (x,x) +2 (x,y) t+ (y,y) t. (1.1) 


This is a real quadratic expression in t which achieves its minimum at t = 
— (x,y) / (y,y) . Substituting this value for t into (1.1) gives 


and hence the desired inequality. 


We now define the norm of the vector x as 


Ixl = v(x, x). 


Hence, in view of (iii) and (iv), ||x|] > 0 for all x €X, and ||x|| = 0 if and only 
if x = 0. The Cauchy—Bunyakowsky—Schwarz inequality, which we henceforth 
refer to as the CBS inequality, then takes the form 


Kx y)| < Ixl llyl] for all x,y €X. (1.2) 


8 1. Inner Product Space 


Corollary 1.7 


If X is an inner product space, then 


Ix+yll < [xl + llyl] for all x,y € X. (1.3) 


Proof 
By definition of the norm, 
2 
Ix+yll" = (x +y,x +y) 


= |x|? + (x,y) + (yx) + ly? 
= ||x\|? + 2Re (x,y) + llyll’. 


But Re (x,y) < |(x,y)| < ||x|| |ly|] by the CBS inequality, hence 


lIx + yl? < Ixl? + 2 Ibell yl + ly? 
= ([lxl| + lly ll)”. 


Inequality (1.3) now follows by taking the square roots of both sides. 


By defining the distance between the vectors x and y to be ||x — y||, we see 
that for any three vectors x,y,z €X, 
Ix —y|| = |x-z+z—-yIl 
< |x —2|| + llz—yll. 
This inequality, and by extension (1.3), is called the triangle inequality, as 
it generalizes a well known inequality between the sides of a triangle in the 
plane whose vertices are the points x,y,z. The inner product space X is now 


a topological space, in which the topology is defined by the norm ||-|| , which is 
derived from the inner product (-,-). 


Example 1.8 


(a) In R” we define the inner product of the vectors 


X =(X1,---,2n), Y=(Y1,---5Yn) 
by 
(x,y) = yi +... + EnYn, (1.4) 
which implies 


Ix] = ft +--+ 23. 
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In this topology the vector space R” is the familiar n-dimensional Euclidean 
space. Note that there are other choices for defining the inner product (x,y), 
such as c(x1Y1 +--+: +2nYn) where c is any positive number, or c)2,y, +- + 
Cn£ZnYn Where c; > 0 for every i. In either case the provisions of Definition 1.5 
are all satisfied, but the resulting inner product space would not in general be 
Euclidean. 


(b) In C” we define 
(z,w) = 21W01 H't + Zn Wn, (1.5) 


for any pair z,w € C”. Consequently, 


lal = yia +--+ lent. 
(c) A natural choice for the definition of an inner product on C([a,b]), by 


analogy with (1.5), is 


b ae 
(fg) = | flade, frg € Cla, b), (1.6) 


i 1/2 
IfI = | J rof al l 


It is a simple matter to verify that the properties (i) through (iv) of the 
inner product are satisfied in each case, provided of course that F= C when 
the vector space is C” or complex C (|a, b]). To check (iv) in Example 1.8(c), 
we have to show that 


so that 


1/2 
| voral =0 & f(x) =0 for all x € [a,b]. 


We need only verify the forward implication (=), as the backward implication 
(<) is trivial. But this follows from a well-known property of continuous, non- 
negative functions: If y is continuous on [a,b], y > 0, and p(w) dx = 0, then 
p = 0 (see [1], for example). Because |f|? is continuous and nonnegative on 
[a,b] for any f € C([a,b]), 


a 2 
I =0> f OPd ES 


In this study, we are mainly concerned with function spaces on which an 
inner product of the type (1.6) is defined. In addition to the topological struc- 
ture which derives from the norm ||- ||, this inner product endows the space with 
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a geometrical structure that extends some desirable notions, such as orthogo- 
nality, from Euclidean space to infinite-dimensional spaces. This is taken up in 
Section 1.3. Here we examine the Euclidean space R” more closely. 


Although we proved the CBS and the triangle inequalities for any inner 
product in Theorem 1.6 and its corollary, we can also derive these inequalities 
directly in R”. Consider the inequality 


(a— b)? = a? —2ab+ b? > 0 (1.7) 


which holds for any pair of real numbers a and b. Let 
ee ee ee E 
VEL to +25 Gare 
If $j- x} #0 and $Y, y; #0, then (1.7) implies 


< + 
—, ` ~ 2 5 P D2 


where the summation over the index j is from 1 to n. After summing on i from 
1 to n, the right-hand side of this inequality reduces to 1, and we obtain 


YD riyi Safe ee: 


This inequality remains valid regardless of the signs of x; and y;, therefore we 


IS aml < fry Dw 


for all x =(£1,...,£n) Æ 0 and y =(m1,..-, Yn) # 0 in R”. But because the 
inequality becomes an equality if either ||x|| or |ly|| is 0, this proves the CBS 


can write 


inequality 
(xy) < lxi llyl] for all x,y € R”. 
From this the triangle inequality ||x + y|] < ||x|| + |ly|| immediately follows. 


Now we define the angle 0 € [0,7] between any pair of nonzero vectors x 
and y in R” by the equation 


(x,y) = [|xl| [ly] cos 8. 


Because the function cos : [0,7] — [—1, 1] is injective, this defines the angle 0 
uniquely and agrees with the usual definition of the angle between x and y in 
both R? and R?. With x 4 0 and y £0, 


(x,y) =0 © cosé=0, 


which is the condition for the vectors x,y € R” to be orthogonal. Consequently, 
we adopt the following definition. 
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Definition 1.9 


(i) A pair of nonzero vectors x and y in the inner product space X is said to 
be orthogonal if (x,y) = 0, symbolically expressed by writing x Ly. A set of 
nonzero vectors V in X is orthogonal if every pair in V is orthogonal. 


(ii) An orthogonal set V C X is said to be orthonormal if ||x|| = 1 for every 
x EV. 


A typical example of an orthonormal set in the Euclidean space R” is 
given by 


en = (0,...,0,1), 
which, as we have already seen, forms a basis of R”. 
In general, if the vectors 
KK) EK (1.8) 


in the inner product space X are orthogonal, then they are necessarily linearly 
independent. To see that, let 


aX, +++: +a,X,=0, a EF, 


and take the inner product of each side of this equation with xk, 1 << k <n. 
In as much as (X;, Xx} = 0 whenever i 4 k, we obtain 


ak (Xk; Xk) = ak ||Xxl|” =0, ke{1, mn} 


=> a,=O0 forall k. 


By dividing each vector in (1.8) by its norm, we obtain the orthonormal set 
{x;/ lxil: 1< i< n}. 


Let us go back to the Euclidean space R” and assume that x is any vector 


in R”. We can therefore represent it in the basis {e;,...,e,} by 
n 
x = 5 Qi€i. (1.9) 
i=] 


Taking the inner product of Equation (1.9) with ex, and using the orthonormal 
property of {e;}, 
(x,e,) =an, ke {1,...,n}. 
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This determines the coefficients a; in (1.9), and means that any vector x in R” 
is represented by the formula 


The number (x,e;) is called the projection of x on e;, and (x,e;) e; is the 
projection vector in the direction of e;. More generally, if x and y 4 0 are any 
vectors in the inner product space X, then (x, y/ ||y||) is the projection of x on 


y, and the vector 
NE y _ (x,y) 
( a) Ivy ly?” 


is its projection vector along y. 


Suppose now that we have a linearly independent set of vectors {x,,..., Xn} 
in the inner product space X. Can we form an orthogonal set out of this set? In 
what follows we present the so-called Gram-Schmidt method for constructing 
an orthogonal set {y1,..., Yn} out of {x;} having the same number of vectors: 
We first choose 

yı = X1- 
The second vector is obtained from x2 after extracting the projection vector of 
xə in the direction of y1, 


X2,Y1 
_ Ray 
lyall 


The third vector is x3 minus the projections of x3 in the directions of yı and yo, 


y2 = X2 


(x3, y1) (x3, y2) 
2 1 2 y2. 
lly lly2ll 


We continue in this fashion until the last vector 


y3 = X3 


(Xn: y1) (Xn: Yn-1) 
2 1 ani 2 Yn-1) 
lyall Ilyn—al| 


Yn = Xn 


and the reader can verify that the set {yi,...,yn} is orthogonal. 


EXERCISES 


1.8 Given two vectors x and y in an inner product space, under what 
conditions does the equality ||x + y||? = ||x||? + |ly||? hold? Can this 
equation hold even if the vectors are not orthogonal? 


1.9 Let x,y € X, where X is an inner product space. 
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1.10 


1.14 


(a) If the vectors x and y are linearly independent, prove that x + y 
and x — y are also linearly independent. 


(b) If x and y are orthogonal and nonzero, when are x + y and 
x — y orthogonal? 


Let y(x) = 1, yo(x) = xz, (xz) = 27, —1 < x < 1. Use (1.6) to 
calculate 

(a) (Y1, P2) 

(b) 

(e) Iv: — gall? 
(d) |291 + 3%2|ļ - 


Determine all orthogonal pairs on [0,1] among the functions ¢, (x) = 
1, pa(£) = x, p(x) = sin 27x, p4(x) = cos 2rx. What is the largest 
orthogonal subset of {Y1, 92, 3, Ya}? 


(Y1, P3) 


Determine the projection of f(x) = cos? x on each of the functions 
filz) =1, folx) = cosa, fs(x) = cos2x, =r Sa < T. 


Verify that the functions %1, Y2, Y3 in Exercise 1.10 are linearly in- 
dependent, and use the Gram-Schmidt method to construct a cor- 
responding orthogonal set. 


Prove that the set of functions {1, x, |x|} is linearly independent on 
{—1, 1], and construct a corresponding orthonormal set. Is the given 
set linearly independent on [0, 1]? 


Use the result of Exercise 1.3 and the properties of determinants to 
prove that any set of functions {f1,..., fn} in C”71(I), I being a 
real interval, is linearly dependent if, and only if, det( fË )) = 0 on 
I, where 1 <i<n,0O<j<n-1. 


Verify that the following functions are orthogonal on [—1, 1]. 


a= a=- d gl Ab rte 


Determine the corresponding orthonormal set. 


Determine the values of the coefficients a and b which make the 
function x? + ax + b orthogonal to both z + 1 and x — 1 on [0,1]. 


Using the definition of the inner product as expressed by Equation 
(1.6), show that ||f|| = 0 does not necessarily imply that f = 0 
unless f is continuous. 
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1.3 The Space L? 


For any two functions f and g in the vector space C((a, b]) of complex contin- 
uous functions on a real interval [a,b], we defined the inner product 


=f f(x (1.10) 


from which followed the definition of the norm 


Ifl= WEA = y S? fa)? de. (1.11) 


As in R”, we can also show directly that the CBS inequality holds in C({a, b]). 
For any f,g € C([a, 6]), we have 


PETRI EEE 


where we assume that ||f|| 40 and ||g|| 4 0. Hence 


” FŒ lE) a ee ie Wee 
aes glee At. If (2)| PLS g(x) |" dex 


= (fl, lal) < IFI lgl]. 
Using the monotonicity property of the integral 


j E f Tdi 


we therefore conclude that 


EDIS Alah < Ial. 


If either ||f|| = 0 or ||g|| = 0 the inequality remains valid, as it becomes an 
equality. The triangle inequality 


lf + gll < WFll + Ilall 
then easily follows from the relation fg + fg = 2Re fg < 2|fg|. 


As we have already observed, the nonnegative number || f — g|| may be re- 
garded as a measure of the “distance” between the functions f, g € C (|a, b]). In 
this case we clearly have || f — g|| = 0 if, and only if, f = g on [a,b]. This is the 
advantage of dealing with continuous functions, for if we admit discontinuous 
functions, such as 


nz) = { Bee aOR (1.12) 


then ||h|| = 0 whereas h # 0. 
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Nevertheless, C (fa, b]) is not a suitable inner product space for pursuing this 
study, for it is not closed under limit operations as we show in the next section. 
That is to say, if a sequence of functions in C([a,b]) “converges” (in a sense 
to be defined in Section 1.4) its “limit” may not be in C(|a,b]). So we need 
to enlarge the space of continuous functions over [a,b] in order to avoid this 
difficulty. But in this larger space, call it X([a, b]), we can only admit functions 
for which the inner product 


b 
(f.9) = i. rorot. 


is defined for every pair f,g € X([a,b]). Now the CBS inequality |(f,g)| < 
|| f|| ||g|| ensures that the inner product of f and g is well defined if || f|| and ||g]| 
exist (i.e., if |f|? and |g|? are integrable). Strictly speaking, this is only true if 
the integrals are interpreted as Lebesgue integrals, for the Riemann integrability 
of f? and g? does not guarantee the Riemann integrability of fg (see Exercise 
1.21); but in this study we shall have no occasion to deal with functions which 
are integrable in the sense of Lebesgue but not in the sense of Riemann. For 
our purposes, Riemann integration, and its extension to improper integrals, is 
adequate. The space X([a,b]) which we seek should therefore be made up of 
functions f such that |f|? is integrable on fa, b]. 


We use the symbol £? (a, b) to denote the set of functions f : [a,b] + C such 
that 


b 
f |f(a)|? dz < o. 


By defining the inner product (1.10) and the norm (1.11) on £?(a,b), we can 
use the triangle inequality to obtain 


laf + Bgll < llafll + |l8gll 
= lol ||fll+ [4] llgll for all f,g € £7(a,b), a, 8 EC, 


hence af + 8g € £L7(a,b) whenever f,g € L7(a,b). Thus £?(a,b) is a linear 
vector space which, under the inner product (1.10), becomes an inner product 
space and includes C((a, b]) as a proper subspace. 

In L?(a, b) the equality || f|| = 0 does not necessarily mean f(a) = 0 at every 
point x € [a,b]. For example, in the case where f(x) = 0 on all but a finite 
number of points in [a,b] we clearly have || f|| = 0. We say that f = 0 pointwise 
on a real interval J if f(x) = 0 at every x € J. If || f|| = 0 we say that f =0 in 
£?(I). Thus the function h defined in (1.12) equals 0 in £?(Z), but not pointwise. 
The function 0 in £?(I) really denotes an equivalence class of functions, each 
of which has norm 0. The function which is pointwise equal to 0 is only one 
member, indeed the only continuous member, of that class. Similarly, we say 
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that two functions f and g in £?(Z) are equal in £L?(J) if || f — g|| = 0, although 
f and g may not be equal pointwise on J. In the terminology of measure theory, 
f and g are said to be “equal almost everywhere.” Hence the space £7(a, b) 
is, in fact, made up of equivalence classes of functions defined by equality in 
L?(a,b), that is, functions which are equal almost everywhere. 


Thus far we have used the symbol L?(a, b) to denote the linear space of func- 
tions f : [a,b] > C such that J? |f(x)|? dx < oo. But because this integral is not 
affected by replacing the closed interval [a,b] by [a,b), (a, b], or (a,b), £? (a,b) 
coincides with £?([a,b)), L2((a,b]) and £?((a,b)). The interval (a,b) need not 
be bounded at either or both ends, and so we have £L7(a,0o), £7(—o0,b) and 
L?(—00, 00) = £?(R). In such cases, as in the case when the function is un- 
bounded, we interpret the integral of |f |? on (a,b) as an improper Riemann 
integral. Sometimes we simply write £? when the underlying interval is not 
specified or irrelevant to the discussion. 


Example 1.10 


Determine each function which belongs to £? and calculate its norm. 


; 1, O0O<a<1/2 
(i f(a) = { 0, 1/2<2<1. 
(ii) f(x) = 1/vz, 0<a<l. 
(iii) f(x) =1/92, 0<x <1. 
(iv) f(z) =1/a, 1< x < œ. 
Solution 
(i) 


2 1 7 1/2 7 1 
IZI = [ Pore da =} 
Therefore f € £7(0,1) and || f|| = 1/V2. 


(ii) 
2 rl ri 
IFI =f -dx = lim —dx = — lim loge = co 
o T e>0t Je T e—=—0+ 
= f ¢ £’ (0,1). 
(iii) 


1 
2 1 : ve 
Il fl =f pde = lim 3(1- 6) =3 


= f € £?°(0,1), |Ifl| = v3. 
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(iv) 


eai]: ; 1 
Ist? = f e= jim - (5-1) =1 


=> f €£7(1,00), |[fl] =1. 


Example 1.11 


The infinite set of functions {1, cos x,sin x, cos 2x, sin2z,...} is orthogonal in 
the real inner product space £2(—7,7). This can be seen by calculating the 


inner product of each pair in the set: 


(1,cosna) = | cosnz dx =0, neéN. 


T 


(sinn) = f sinng dr =0, nEN. 


T 


Tv 
(cos nx, cos mz) = if cos ng cos mg dx 
-F 
T 


= [cos(n — m)x + cos(n + m)a]dx 

= 1 | sinn- mo + —— sin(a + ma] | 
~%5 PA PW E NE E YE TE 
=0, n#m. 


T 
(sin nzg,sin mz) = L sinnasinma dx 


T 


= i T [cos(n — m)x — cos(n + m)a]dx 


=0, nám. 


=f. 


T 
(cosng,sin mz) = I cosngsin mg dz =0, n,m €N, 
= 


because cos ng sin mg is an odd function. Furthermore, 


I1] = v27, 


T 1/2 
\|cos nax|| = | cos? ng de! = y7, 


=R: 


=p 


T 1/2 
||sin na|| = | sin? ng da! = Vn, neéNn. 
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Thus the set 


1 cosx sing cos2a@ sin2zx 
[= VE’ yn’ fn yr of 

which is obtained by dividing each function in the orthogonal set by its norm, 

is orthonormal in £?(—7, 7). 


Example 1.12 
The set of functions 
fee nE Zy pene eye ety 


is orthogonal in the complex space £L?(—7,7), because, for any n 4 m, 


K 
(ein, ame) = J er eimz dy 


=A 


T 
= / EPT ENNE dx 
<I 


1 


i(n —m) 


T 
el(n—m)ax 


=0. 


= 


By dividing the functions in this set by 


1/2 


T 
|e” || = J girada] =A IT, ne Z, 
-T 
we obtain the corresponding orthonormal set 


= n 5 
V 2T 


If p is a positive continuous function on (a,b), we define the inner product 
of two functions f,g € C (a,b) with respect to the weight function p by 


b 
(Fo), = f f(æ)ala)o(e)dz, (1.13) 


and we leave it to the reader to verify that all the properties of the inner 
product, as given in Definition 1.5, are satisfied. f is then said to be orthogonal 
to g with respect to the weight function p if (f, g), = 0. The induced norm 


b 1/2 
Ifl, = / OP ota 


satisfies all the properties of the norm (1.11), including the CBS inequality 
and the triangle inequality. We use £2(a,b) to denote the set of functions f : 
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(a,b) — C, where (a,b) may be finite or infinite, such that || ||, < oo. This is 
clearly an inner product space, and £?(a,b) is then the special case in which 


p=l. 


EXERCISES 


1.19 


1.20 


1.21 


1.22 


1.23 


1.24 


1.25 
1.26 
1.27 


1.28 


1.29 


Prove the triangle inequality || f+ g|| < ||f|] + |lg|| for any f,g € 
L?(a,b). 

Verify the CBS inequality for the functions f(x) = 1 and g(x) = x 
on [0,1]. 

Let the functions f and g be defined on [0,1] by 


f(a) ={ = epee s HOS Bears S041, 


where Q is the set of rational numbers. Show that both f? and g? 
are Riemann integrable on [0,1] but that fg is not. 


Determine which of the following functions belongs to £7(0,00) and 
calculate its norm. 


ey hx i) si as 
(i) e”, (ii) sina, (iii) ES 
If f and g are positive, continuous functions in £?(a, b), prove that 
(f,9) = |I f|||ig|| if, and only if, f and g are linearly dependent. 


Discuss the conditions under which the equality || f + g|| = || fII+Ilgll 
holds in £L7(a, b). 


Determine the real values of a for which z® lies in £7(0, 1). 
Determine the real values of a for which z® lies in £?(1, 00). 


If f € L£7(0,c0) and lim, .. f(x) exists, prove that limz 
f(a) =0. 

Assuming that the interval (a,b) is finite, prove that if f € £7(a,b) 
then the integral f? |f(x)| dz exists. Show that the converse is false 


by giving an example of a function f such that |f] is integrable on 
(a,b), but f ¢ L7(a,b). 


If the function f : [0,00) — R is bounded and |f| is integrable, 
prove that f € £7(0,00). Show that the converse is false by giving 
an example of a bounded function in £L2(0, 00) which is not integrable 
on [0, 00). 
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3 


1.30 In £?(—7,7), express the function sin? x as a linear combination of 


the orthogonal functions {1, cos x, sin x, cos 2a,sin 2x,...}. 


1.31 Define a function f € £?(—1,1) such that (f,2?+1) = 0 and 


II fll = 2. 
1.32 Given p(x) = e`”, prove that any polynomial in x belongs to 
£70.60): 


1.33 Show that if p and ø are two weight functions such that p > o > 0 on 
(a,b), then £2(a,b) © £3(a,b). 


1.4 Sequences of Functions 


Much of the subject of this book deals with sequences and series of functions, 
and this section presents the background that we need on their convergence 
properties. We assume that the reader is familiar with the basic theory of 
numerical sequences and series which is usually covered in advanced calculus. 


Suppose that for each n € N we have a (real or complex) function fn : I — F 
defined on a real interval J. We then say that we have a sequence of functions 
(fn : n € N) defined on I. Suppose, furthermore, that, for every fixed x € J, 
the sequence of numbers (fn(x) : n € N) converges as n — oo to some limit in 
F. Now we define the function f : I — F, for each x € I, by 


f(z) = lim fala). (1.14) 


That means, given any positive number €, there is a positive integer N such 
that 
n>N => |fn(x) - f(a)| <e. (1.15) 


Note that the number N depends on the point x as much as it depends on 
€, hence N = N(e,x). The function f defined in Equation (1.14) is called the 
pointwise limit of the sequence (fn). 


Definition 1.13 


A sequence of functions fn : I — F is said to converge pointwise to the function 
f : I — F, expressed symbolically by 


lim fn = f, lim fn = Í, or fr — f, 


if, for every x € I, limp—se6 fn(@) = f(x). 
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Example 1.14 


1 
(i) Let fa(x) = — sin ng, x € R. In as much as 
n 


1 
lim fa(x)= lim —sinnz =0 for every x € R, 
meoo n> n 


the pointwise limit of this sequence is the function f(z) = 0, x € R. 
(ii) For all x € [0,1], 


ES 0, O<a<l 
fale) =e ->f 1, «=1, 
hence the limit function is 
0, O<a<l 
r=] Caa (1.16) 


as shown in Figure 1.1. 
(iii) For all x € [0, 20), 


Nx 0, x=0 
KOSTE s=] 1, x>0. 
Example 1.15 
For each n € N, define the sequence fn : [0,1] — R by 
0, «x=0 
fala)= 4 n, 0O<e<1f/n 


0, I/n<@<l. 


ya a y 
1- š 
y= fx) 
m 0 1 x 


Figure 1.1 The sequence f,(x) = z”. 
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-o o o H 
1/n l x 


Figure 1.2 


To determine the limit f, we first note that f,(0) = 0 for all n. If x > 0, 
then there is an integer N such that 1/N < x, in which case 


1 1 
Therefore fn — 0 (see Figure 1.2). 


If the number N in the implication (1.15) does not depend on zx, that is, if 
for every £ > 0 there is an integer N = N(e) such that 


n>N => |fn(a)—f(x)|<e forallve TI, (1.17) 


then the convergence f, — f is called uniform, and we distinguish this from 
pointwise convergence by writing 


h>. 
Going back to Example 1.14, we note the following. 
(i) Since 
|fa(x) — 0| = sinne < L for all x € R, 


we see that any choice of N greater than 1/e will satisfy the implication (1.17), 
hence 


1. u 
—sinnz > 0 on R. 
n 


(ii) The convergence z” — 0 is not uniform on [0,1) because the implication 


n> N > |je”-0|=r”<e 
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cannot be satisfied on the whole interval [0, 1) if 0 < £ < 1, but only on (0, 4/2), 
because x” > e for all x € (%/e,1). Hence the convergence fn — f, where f is 
given in (1.16), is not uniform. 


(iii) The convergence 


nx 
1, 0, 
1+ nz te) 
is also not uniform in as much as the inequality 
Nx 1 Z 
= E 
1l+nz 1l+nzaz 


cannot be satisfied for values of x in (0, (1 — ¢)/ne] if0<e< 1. 


Remark 1.16 


1. The uniform convergence fp > f clearly implies the pointwise convergence 
fn — f (but not vice versa). Hence, when we wish to test for the uniform 
convergence of a sequence fn, the candidate function f for the uniform limit of 
fn should always be the pointwise limit. 

2. In the inequalities (1.15) and (1.17) we can replace the relation < by < and 
the positive number € by ce, where c is a positive constant (which does not 
depend on n). 


3. Because the statement | f,(2) — f(x)| < € for all x € I is equivalent to 


sup |fn(a) — f(@)| < £, 
GET 


we see that fa > f on I if, and only if, for every £ > 0 there is an integer N 
such that 


n>N = sup|fn(z) — f(2)| < e, 
cel 


which is equivalent to the statement 


sup |fn(x) — f(z)| - 0 as n— oo. (1.18) 
wel 


Using the criterion (1.18) for uniform convergence on the sequences of 
Example 1.14, we see that, in (i), 
<= 30 
n 


— sin ng 
n 


sup 
xER 


? 


thus confirming the uniform convergence of sinna/n to 0. In (ii) and (iii), we 
have 


sup |v” — f(x)|= sup z” =1+0, 
x€(0,1] x€[0,1) 
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hence neither sequence converges uniformly. 


Although all three sequences discussed in Example 1.14 are continuous, only 
the first one, (sinna/n), converges to a continuous limit. This would seem to 
indicate that uniform convergence preserves the property of continuity as the 
sequence passes to the limit. We should also be interested to know under what 
conditions we can interchange the operations of integration or differentiation 
with the process of passage to the limit. In other words, when can we write 


[im fr(ax)dx = lim fale), or (lim fay = lim fi on I? 
I I 


The answer is contained in the following theorem, which gives sufficient condi- 
tions for the validity of these equalities. This is a standard result in classical 
real analysis whose proof may be found in a number of references, such as [1] 
or [14]. 


Theorem 1.17 


Let (fn) be a sequence of functions defined on the interval I which converges 
pointwise to f on I. 
(i) If fn is continuous for every n, and fp > f, then f is continuous on J. 


(ii) If fn is integrable for every n, I is bounded, and fan > f, then f is integrable 


on I and 
[ teoyae = tim f fa(w)ae. 


(iii) If fn is differentiable on I for every n, I is bounded, and ff converges 
uniformly on J, then f» converges uniformly to f, f is differentiable on J, and 


f> f onI. 


n 


Remark 1.18 


Part (iii) of Theorem 1.17 remains valid if pointwise convergence of fn on I is 
replaced by the weaker condition that f» converges at any single point in I, for 
such a condition is only needed to ensure the convergence of the constants of 
integration in going from ff, to fn. 


Going back to Example 1.14, we observe that the uniform convergence of 
sinna/n to 0 satisfies part (i) of Theorem 1.17. It also satisfies (ii) over any 
bounded interval in R. But (iii) is not satisfied, in as much as the sequence 
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dx 


is not convergent. The sequence (x°) is continuous on [0,1] for every n, but its 


d G : ) 
— | —sinnz } = cosnz 
n 


limit is not. This is consistent with (i), because the convergence is not uniform. 
The same observation applies to the sequence nz/(1 + nx). 


In Example 1.15 we have 


1/n 
[he a ndz=1 forallneN 


tim [fu fn(a)dx = 1, 


[im fn(a)dx = 0. 


This implies that the convergence fn — 0 is not uniform, which is confirmed 
by the fact that 


whereas 


On the other hand, 
1 1 
lim | «2dr = 0 = | lim z”dz, 
0 0 


although the convergence x” — 0 is not uniform, which indicates that not all 
the conditions of Theorem 1.17 are necessary. 


Given a sequence of (real or complex) functions (fn) defined on a real in- 
terval I, we define its nth partial sum by 


Sn(2) = fat) +--+ fala) = Oo fila), 2 EL. 


The sequence of functions (Sn), defined on J, is called an infinite series (of 
functions) and is denoted ` fp. The series is said to converge pointwise on 
I if the sequence (Sn) converges pointwise on I, in which case )> fẹ is called 
convergent. Its limit is the sum of the series 


lim S.(2)= So fle), BEL. 
sess k=1 


Sometimes we shall find it convenient to identify a convergent series with its 
sum, just as we occasionally identify a function f with its value f(a). A series 
which does not converge at a point is said to diverge at that point. The se- 
ries ` fx is absolutely convergent on I if the positive series 5> | fẹ| is pointwise 
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convergent on I, and uniformly convergent on I if the sequence (Sn) is uni- 
formly convergent on J. In investigating the convergence properties of series 
of functions we naturally rely on the corresponding convergence properties of 
sequences of functions, as discussed earlier, because a series is ultimately a 
sequence. But we shall often resort to the convergence properties of series of 
numbers, which we assume that the reader is familiar with, such as the various 
tests of convergence (comparison test, ratio test, root test, alternating series 
test), and the behaviour of such series as the geometric series and the p-series 
(see [1] or [3]). 


Applying Theorem 1.17 to series, we arrive at the following result. 


Corollary 1.19 


Suppose the series X` f,, converges pointwise on the interval J. 

(i) If fn is continuous on I for every n and J` fn converges uniformly on J, 
then its sum X>; fn is continuous. 

(ii) If fn is integrable on I for every n, I is bounded, and X` fn converges 
uniformly, then X57; fn is integrable on I and 


X fn(x)dz = $ | fn(x)de. 
In=1 n=1JI 

(iii) If fn is differentiable on I for every n, I is bounded, and J- fj, converges 
uniformly on J, then J` fn converges uniformly and its limit is differentiable 
on I and satisfies 


This corollary points out the relevance of uniform convergence to manipu- 
lating series, and it would be helpful if we had a simpler and more practical 
test for the uniform convergence of a series than applying the definition. This 
is provided by the following theorem, which gives a sufficient condition for the 
uniform convergence of a series of functions. 


Theorem 1.20 (Weierstrass M-Test) 


Let (fn) be a sequence of functions on J, and suppose that there is a sequence 
of (nonnegative) numbers Mp such that 


lfn(z)|< Ma forallael, nen. 


If X M, converges, then X` fn converges uniformly and absolutely on T. 
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Proof 


Let € > 0. We have 


Ma 
IA 
M 
= 


E fel) — 
k=1 


k=1 k=n+1 


< X Mk foralawel, neNn. 
k=n+1 


Because the series X` Mp is convergent, there is an integer N such that 


[oe 
n> N> DY Mex<e 
k=n+1 


> 2 fk(£) = 3 fka) <e forallrel. 
k=1 


> 
Il 
= 


By definition, this means J` fp is uniformly convergent on J. Absolute conver- 
gence follows by comparison with Mp. 


Example 1.21 
(i) The trigonometric series 
1. 
— sin ng 
ae 
is uniformly convergent on R because 


1 
<5 
n? 


ao sin ng 
n 


and the series )>1/n? is convergent. Because sin ng/n? is continuous on R for 
every n, the function X>; sinnz/n? is also continuous on R. Furthermore, by 
Corollary 1.19, the integral of the series on any finite interval [a,b] is 


b 
œ 1 
f Cs 72 sin ne) dz = > r sinne dx 


= = (cos na — cos nb) 


n=1 
S l 
SiL 
which is convergent. On the other hand, the series of derivatives 


co d 1 Sss 
> T (<= sinne) = > —cosnx 
n=1 


n=1 n 
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is not uniformly convergent. In fact, it is not even convergent at some values 
of x, such as the integral multiples of 27. Hence we cannot write 


dx yan nal Nn 


(ii) By the M-test, both the series 
1, 
5 73 sin nx 
and Se i s 
pa aA (= sin ne) = pai 72 cos nx 


are uniformly convergent on R. Hence the equality 


d X eni] 
=o ~ sinnz = )7 — cosnax 
dx p1 n=1 n 


is valid for all z in R. 


EXERCISES 


1.34 Calculate the pointwise limit where it exists. 


(9) Ta 
(b) yz, 0< z< œ. 


zER. 


(c) sinna, x ER. 

1.35 Determine the type of convergence (pointwise or uniform) for each 
of the following sequences. 
(a) T4gn’ 
(b) Ya, 1/2<a<1. 
(c) 7Wz,0<a<l. 


1.36 Determine the type of convergence for the sequence 


Raed 


n 


O0<ar<2. 


nx, O<a<l1/n 
1 I/n<a <1, 


b 


and decide whether the equality 


im f sa)te = [im toy 


is valid. 
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1.37 Evaluate the limit of the sequence 


1.38 


1.39 


1.40 


1.41 


1.42 


1.43 


1.44 


fns, O<z<1/n 
fulz) = { n(l1—2)/(n—-1), 1/n<x<1, 


and determine the type of convergence. 


Determine the limit and the type of convergence for the sequence 
fn(x) = nx(1 — x°)” on [0,1]. 
Prove that the convergence 


x 


—0 
n+2x 


is uniform on [0,a] for any a > 0, but not on [0, 00). 


Given 


pod ym Kl<e 


|z| >n, 
prove that fn > 0. Evaluate lim [™. f(x)dx and explain why it is 
not 0. 


If the sequence (fn) converges uniformly to f on [a,b], prove that 
lfn — f|, and hence |f — f|? , converges uniformly to 0 on fa, b]. 


Determine the domain of convergence of the series X` fn, where 


(@) fala) = a. 
(b) fale) = 7a. 


If the series J` an is absolutely convergent, prove that ` ansin nz is 
uniformly convergent on R. 


Prove that 
(n+1)m jas 
lim EE ee 
T00 nt 


Use this to conclude that the improper integral 


Conus 
sin Tt 
dx 
0 x 


exists. Show that the integral Jo (sin x| /x)dx = oo. Hint: Use the 
alternating series test and the divergence of the harmonic series 


Yo 1/n. 
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1.45 The series 
co 
D ane” = ao + aye + aor? +: 


is called a power series about the point 0. It is known (see [1]) that 
this series converges in (—R, R) and diverges outside [—R, R], where 


=i 
R= | lim Vian] = lim 


n— co 


an 


> 0. 


An+1 


If R > 0, use the Weierstrass M-test to prove that the power series 
converges uniformly on [—R+e, R—e], where £ is any positive number 
less than R. 


1.46 Use the result of Exercise 1.45 to show that the function 
f(a) =X ane” 
is continuous on (—R, R); then show that f is also differentiable on 
(—R, R) with 
f(a) = È mane 


1.47 From Exercise 1.46 conclude that the power series f(x) = Xpo anx” 
is differentiable any number of times on (—R, R), and that a, = 
f©(0)/n! for all n € N. 


1.48 Use the result of Exercise 1.47 to obtain the following power series 
(Taylor series) representations of the exponential and trigonometric 
functions on R. 


go os 
we 2 n! : 
oo gen 
i pice —1)” 
cos £ > ) (any! 
; oo pent 
sing = OU" Gat iy 


1.49 Use the result of Exercise 1.48 to prove Euler’s formula et? = cos a+ 
isin for all z € R, where i = /—I. 
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1.5 Convergence in L? 


Having discussed pointwise and uniform convergence for a sequence of func- 
tions, we now consider a third type: convergence in L?. 


Definition 1.22 


A sequence of functions (fn) in £?(a, b) is said to converge in L? if there is a 
function f € L*(a,b) such that 


im [fn — fl] =0, (1.19) 
that is, if for every £ > 0 there is an integer N such that 
n> Ns ||fn- fll <e. 
Equation (1.19) is equivalent to writing 
L? 
Ta af; 


and f is called the limit in L? of the sequence (fn). 


Example 1.23 


(i) In Example 1.14(ii) we saw that, pointwise, 


Because £?([0, 1]) = £?([0, 1)), we have 


lz” — Ol| = |f a 


Le 
Therefore x” = 0. 


(ii) The sequence of functions (fn) defined in Example 1.15 by 


1/2 


0, «=0 
fr(t)=4 n, O<a<i/n 
0, I/n<a<l 
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also converges pointwise to 0 on [0,1]. But in this case, 


1 
Ifa — Ol? = I P(a)de 


1/n 
=} ndz 
0 


=n foralneN. 


Thus || fn —0|| = Vn ~ 0, which means the sequence fan does not converge to 
OinL?, 


This last example shows that pointwise convergence does not imply conver- 
gence in L°. Conversely, convergence in L? cannot imply pointwise convergence, 
because the limit in this case is a class of functions (which are equal in £? but 
not pointwise). It is legitimate to ask, however, whether a sequence that con- 
verges pointwise to some limit f can converge to a different limit in £2. For 
example, can the sequence (f,) in Example 1.23(ii) converge in £? to some 
function other than 0? The answer is no. In other words, if a sequence con- 
verges both pointwise and in £7, then its limit is the same in both cases. More 
precisely, we should say that the two limits are not distinguishable in £? as 
they belong to the same equivalence class. 


On the other hand, uniform convergence fa > f over I implies pointwise 
convergence, as we have already observed, and we now show that it also implies 
2 
Fn £, f provided the sequence (fn) and f lie in £?(T) and T is bounded: Because 
fn — f Š 0, it is a simple matter to show that |f, — f|? + 0 (Exercise 1.41). 
By Theorem 1.17(ii), we therefore have 


tim [fn — FI? = im fy |fn(@) — Fie) de 
= fy lim | fn(x) — f(a)? de = 0. 


The condition that f belong to £7(J) is actually not needed, as we shall discover 
in Theorem 1.26. 


Example 1.24 
We saw in Example 1.21 that 
n 1 i es) 
Sahla) = > pa sinks > Sia) = > qa sinks, xeER, 
k=1 k=1 


hence the function S(x) is continuous on [—7,7]. Moreover, both Sn and S' lie 
in L?(—r, T) because each is uniformly bounded above by the convergent series 
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5 1/k?. Therefore S, converges to S in £2(—7,7). Equivalently, we say that 
the series > sin ka/k? converges to Xp; sin ka/k? in £L?(—7,7) and write 


ne esaa l 
lim ` Rp sink = >> pz sin ka in £°(—1, T). 
k=1 k=1 


The series ` sin ka/k, on the other hand, cannot be tested for convergence 
in £? with the tools available, and we have to develop the theory a little further. 
First we define a Cauchy sequence in £? along the lines of the corresponding 
notion in R. This allows us to test a sequence for convergence without having 
to guess its limit beforehand. 


Definition 1.25 


A sequence in £? is called a Cauchy sequence if, for every £ > 0, there is an 
integer N such that 
m,n > N => | fn — fml| < €. 


Clearly, every convergent sequence (fn) in £? is a Cauchy sequence; for if 


Tri 2 f, then, by the triangle inequality, 
lfa — fall < lfa = FI lfm — fl, 


and we can make the right-hand side of this inequality arbitrarily small by 
taking m and n large enough. The converse of this statement (i.e., that every 
Cauchy sequence in £? converges to some function in £?) is also true and 
expresses the completeness property of L?. 


Theorem 1.26 (Completeness of £?) 
For every Cauchy sequence (fn) in £? there is a function f € £? such that 


hag 


There is another theorem which states that, for every function f € £? (a,b), 


there is a sequence of continuous functions (fn) on [a,b] such that fn 2 f. In 
other words, the set of functions C([a, b]) is dense in £?(a, b) in much the same 
way that the rationals Q are dense in R, keeping in mind of course the different 
topologies of R and £?, the first being defined by the absolute value |-| and the 
second by the norm |j-||. For example, the £?(—1,1) function 


0, -l<a<0O 
ra=] 1, 0<2<l, 
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which is discontinuous at x = 0, can be approached in the £? norm by the 
sequence of continuous functions 


0, —1 <x < -—l1/n 
fnh= 4 nz+1, —l/n<xr<0 
1 O0<a<l. 


? 


This is clear from 


j 1/2 
Jim, Ha — A jim, Lf O-O ae 


= lim 1/V3n =0. 


Needless to say, there are many other sequences in C([—1,1]) which converge 
to f in £?(—1,1), just as there are many sequences in Q which converge to the 
irrational number v2. 


As we shall have occasion to refer to this result in the following chapter, we 
give here its precise statement. 


Theorem 1.27 (Density of C in £7) 


For any f € £7(a,b) and any € > 0, there is a continuous function g on [a,b] 
such that || f — g|| < €. 


The proofs of Theorems 1.26 and 1.27 may be found in [14]. The space 
L? is one of the most important examples of a Hilbert space, which is an inner 
product space that is complete under the norm defined by the inner product. It 
is named after David Hilbert (1862-1943), the German mathematician whose 
work and inspiration did much to develop the ideas of Hilbert space (see [7], 
vol. I). Many of the ideas that we work with are articulated within the context 
of L?. 


Example 1.28 


Using Theorem 1.26, we can now look into the question of convergence of the 
sequence S;,(x) = ) p1 sin kz/k in L?(—r, 7). Noting that 


2 
n 


1 
X =sinks 
k=m+1 k 


|| Su(z) — Sin(z)|? = 


>, m<n, 
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we can use the orthogonality of {sin kz : k € N} in £?(—7,7) (Example 1.11) 


to obtain 


n 1 
X =sinks 
k=m+1 k 


n 1l 3 no 
= YY > lsinke| =a X 
k=m+1 k2 k=m+1 k? 
Suppose £ > 0. Since X` 1/k? is convergent, we can choose N so that 
NE 
n>m = 
E m+1 k? 


k= 
> ||Sn(£) — Sm(2)|| < €. 


Thus $`}; sin kz/k is a Cauchy sequence and hence converges in L?(—r, 7), 


although we cannot as yet tell to what limit. 


Similarly, the series ` cos kx/k converges in L?(—r, 7), although this series 
diverges pointwise at certain values of x, such as all integral multiples of 27. 


This section was devoted to convergence in £? because of its importance to 


the theory of Fourier series, but we could just as easily have been discussing 
convergence in the weighted space ie. Definitions 1.22 and 1.25 and Theorems 
1.26 and 1.27 would remain unchanged, with the norm ||-|| replaced by ||-||, and 


convergence in £L? by convergence in L?. 


EXERCISES 


1.50 Determine the limit in £? of each of the following sequences where 


it exists. 
(a) fala) = Ya, 0<xr<1. 


nz, O<a<l1/n 


Ono, 
(c) falx) =na(l—a2)", O<a<1. 


1.51 Test the following series for convergence in L?. 
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1.52 If (fn) is a sequence in £?(a,b) which converges to f in £?, show 
2 
that (fn,g) © (f, g) for any g € £7(a,b). 


2 

1.53 Prove that ||| f|] — |lg|l| < || f — g|| , and hence conclude that if fn 5 
f then || fall > IFI- 

1.54 If the numerical series > |an| is convergent, prove that Y |a,|” is 


also convergent, and that the series }* ansin ng and J` an cos ng are 
both continuous on [—7, 7]. 


1.55 Prove that if the weight functions p and ø are related by p > ø on 
(a,b), then a sequence which converges in £L7(a, b) also converges in 
LEGO): 


1.6 Orthogonal Functions 


Let 
{1; V2, 3,---} 


be an orthogonal set of (nonzero) functions in the complex space £L?, which 
may be finite or infinite, and suppose that the function f € £? is a finite linear 
combination of elements in the set {y,}, 


f= May; a €C. (1.20) 
i=l 
Taking the inner product of f with y,, 


(f, Pp) = aklly? foral k=1,...,n, 


we conclude that 


— (f, Yr) 


= L, 
IlPall 
and the representation (1.20) takes the form 


Qk 


In other words, the coefficients a, in the linear combination (1.20) are deter- 
mined by the projections of f on y;. In terms of the corresponding orthonormal 


set {Vk = Pr/ llPnll}: p 
f= 2 (F, Vk) Vrs 


and the coefficients coincide with the projections of f on Yẹ. 
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Suppose, on the other hand, that f is an arbitrary function in £? and that 
we want to obtain the best approximation of f in £7, that is, in the norm ||-|| , 
by a finite linear combination of the elements of {pp}. We should then look for 
the coefficients a; which minimize the nonnegative number 


|r wy AkPk 
k=1 
We have 
n 2 n n 
l- D akp = (F- D ape- axe) 
k=1 k=1 k=1 


= |f? -2 $ Reade (fn) + Z lol? vel? 
k=1 k=1 


= Wl? So eee 


fa livell? 


_ (f, Px) sens 
+ 2 eel -2R + 
2 lasl flez wa el 
(F Pk) j 


= IfI -y i +Y lol? 


k=1 k=1 


2 
lexi 


Since the coefficients a, appear only in the last term 


2 
dE liol 
k=1 


we obviously achieve the minimum of ||f -X= one, ||” , and hence of 
IF — Xr=1 Ae Pell , by choosing 


(f, Pk) 3 
lell? 


Ak — 


lleell? 
This minimum is given by 
(F, Pr) KS, Pr) 
l-> 3 ZP = IA? -5 eL Ei, (1.21) 
1 llel ta (ly ia 
This yields the relation, 
~ Cf, Pe) 
yas Ss WP? 
ra (ly Ty 
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Since this relation is true for any n, it is also true in the limit as n — oo. The 
resulting inequality 


S eM <i ple, (1.22) 


fa eal 


known as Bessel’s inequality, holds for any orthogonal set {pp : k € N} and 
any f € L?. 
In view of (1.21), Bessel’s inequality becomes an equality if, and only if, 


l-5 Doh a 
k=1 


3 


Pell 


or, equivalently, 


f = (f, Pr) Pk in i 
rat lell 


which means that f is represented in £? by the sum >>", QkYp, where ay, = 


(E, Pr) / pl? 


Definition 1.29 


An orthogonal set {,,:n € N} in £? is said to be complete if, for any f € L?, 


Thus a complete orthogonal set in £? becomes a basis for the space, and 
because £? is infinite-dimensional the basis has to be an infinite set. When 
Bessel’s inequality becomes an equality, the resulting relation 


Wl? = 5 Meee (1.23) 


lea 


is called Parseval’s relation or the completeness relation. The second term is 
justified by the following theorem, which is really a restatement of Definition 
1.29. 


Theorem 1.30 


An orthogonal set {y,, : n € N} is complete if, and only if, it satisfies Parseval’s 
relation (1.23) for any f € L?. 
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Remark 1.31 


1. Given any orthogonal set {y,, : n € N} in £?, we have shown that we obtain 
the best £?-approximation 


n 
D QAkPk 
k=1 


of the function f € L? by choosing ap = (f, pp) /llg,ll?, and this choice is 
independent of n. If {y,,} is complete then the equality f = >, any, holds 
in £?. 

2. When the orthogonal set {y,,} is normalized to {w, = Y%/ ||~x||}, Bessel’s 
inequality takes the form 


MF eed? <I? 
k=1 

and Parseval’s relation becomes 
fl? = 2 CRAIR 


3. For any f € L?, because ||f|| < oo, we conclude from Bessel’s inequality that 
(f, Wn) — 0 whether the orthonormal set {w,,} is complete or not. 


Parseval’s relation may be regarded as a generalization of the theorem of 
Pythagoras from R” to £?, where || f |? replaces the square of the length of the 
vector, and X> |(f, Wn) |? represents the sum of the squares of its projections 
on the orthonormal basis. That is one reason why £? is considered the natural 
generalization of the finite-dimensional Euclidean space to infinite dimensions. 
It preserves some of the basic geometric structure of R”, and the completeness 
property (Theorem 1.26) guarantees its closure under limiting operations on 
Cauchy sequences. 


EXERCISES 


1.56 If J is any positive number, show that {sin(naa/l) : n € N} and 
{cos(n7a/l) : n € No} are orthogonal sets in £7(0,1). Determine the 
corresponding orthonormal sets. 


1.57 Determine the coefficients c; in the linear combination 
cy + Cosin Tg + c3sin 27x 


which give the best approximation in £?(0, 2) of the function f(x) = 
x,0<a4<2. 
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1.58 


1.59 


1.60 


1.61 


1.62 
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Determine the coefficients a; and b; in the linear combination 
ao + acos x + bysin x + agcos 2x + bosin 2x 


which give the best approximation in £L2(—7, 7) of f(x) = |x|, =r < 
WT: 


Let p1, p2, and p3 be the three orthogonal polynomials formed from 
the set {1, 2,27} by the Gram-Schmidt method, where —1 < x < 1. 
Determine the constant coefficients in the second-degree polynomial 
aıpı(x) + agpo(x) + a3p3(x) which give the best approximation in 
£?(-1,1) of e”. Can you think of another polynomial p of degree 2 
which approximates e” on (—1, 1) in a different sense? 


Assuming that 


_— 1 (2n — 1)r 
= 5 < < 2 
x z2 2 uA cos 5 x, O<a#<2, 


use Parseval’s identity to prove that 


_ 1 
= 96 —. 
> (2n — 1)4 


Define a real sequence (ap) such that X` a? converges and Y` a, di- 
verges. What type of convergence can the series X` an cosna, =r < 
x < r have? 


Suppose {fn : n € N} is an orthogonal set in £7(0,1), and let 


[fn(x) + fr(—2)], 
[fn(x) — fn(—2)], —-l<a<l, 


Prlz) = 


Palt) = 


1 
2 
1 
2 


be the even and odd extensions, respectively, of fn from [0,1] to 
[—1, J]. Show that the set {y,,} U {Yn} is orthogonal in £?(—1,1). If 
{fn} is orthonormal in £2(0,1), what is the corresponding orthonor- 
mal set in £?(—I,1)? 


2 


The Sturm—Liouville Theory 


Complete orthogonal sets of functions in £? arise naturally as solutions of 
certain second-order linear differential equations under appropriate boundary 
conditions, commonly referred to as Sturm—Liouville boundary-value problems, 
after the Swiss mathematician Jacques Sturm (1803-1855) and the French 
mathematician Joseph Liouville (1809-1882), who studied these problems and 
the properties of their solutions. The differential equations considered here 
arise directly as mathematical models of motion according to Newton’s law, 
but more often as a result of using the method of separation of variables to 
solve the classical partial differential equations of physics, such as Laplace’s 
equation, the heat equation, and the wave equation. 


2.1 Linear Second-Order Equations 


Consider the ordinary differential equation of second order on the real interval 
I given by 

ao(x)y" + ar(x)y! + a2(x)y = f(a), (2.1) 
where ag, @1, a2, and f are given complex functions on J. When f = 0 on 
I, the equation is called homogeneous, otherwise it is nonhomogeneous. Any 
(complex) function y € C?(T) is a solution of Equation (2.1) if the substitution 
of y by ¢ results in the identity 


ao(x)p" (x) + alx)’ (x) + a2(x) p(x) = f(x) for all x € I. 
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If we denote the second-order differential operator 


al) H ai (2) = + a2(x) 


by L, then Equation (2.1) can be written in the form Ly = f. The operator 
L is linear, in the sense that 


Lhapt ep) = ca Ly + coly 


for any functions y, € C(I) and any constants c1,c2 € C, hence (2.1) is 
called a linear differential equation. Unless otherwise specified, all differential 
equations and operators that we deal with are linear. A fundamental property 
of linear homogeneous equations is that any linear combination of solutions of 
the equation is also a solution; for if y and w satisfy 


Ly =0, Ly =0, 
then we clearly have 
Lhapt+ev) = ca Lo + c2 Ly = 0 


for any pair of constants cı and c2. This is known as the superposition principle. 


If the function ao does not vanish at any point on J, Equation (2.1) may be 
divided by ao to give 


y” + q(x)y' + r(ax)y = g(x), (2.2) 


where q = aı/aọ, r = a2/ao, and g = f/aọ. Equations (2.1) and (2.2) are 
clearly equivalent, in the sense that they have the same set of solutions. Equa- 
tion (2.1) is then said to be regular on I; otherwise, if there is a point c € I 
where ag(c) = 0, the equation is singular, and c is then referred to as a singular 
point of the equation. 


According to the existence and uniqueness theorem for linear equations (see 
(6]), if the functions q,r, and g are all continuous on J and 2p is a point in J, 
then, for any two numbers € and n, there is a unique solution y of (2.2) on 
I such that 

y(to) =,  g'(zo)=n. (2.3) 


Equations (2.3) are called initial conditions, and the system of equations (2.2) 
and (2.3) is called an initial-value problem. 


Here we list some well-known properties of the solutions of Equation (2.2), 


which may be found in many standard introductions to ordinary differential 
equations. 
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1. The homogeneous equation 
y” +q(z)y'+r(x)y=0, vel, (2.4) 


has two linearly independent solutions y;(x) and y2(x) on I. A linear com- 
bination of these two solutions 


C1Y1 + C2Y2, (2.5) 


where cı and c2 are arbitrary constants, is the general solution of (2.4); that 
is, any solution of the equation is given by (2.5) for some values of cı and c2. 
When c1 = c2 = 0 we obtain the so-called trivial solution 0, which is always 
a solution of the homogeneous equation. By the uniqueness theorem, it is 
the only solution if € = 7 = 0 in (2.3). 


2. If yp(#) is any particular solution of the nonhomogeneous Equation (2.2), 
then 
Yp + C1Y1 + C2Y2 


is the general solution of (2.2). By applying the initial conditions (2.3), the 
constants cı and c2 are determined and we obtain the unique solution of 
the system of Equations (2.2) and (2.3). 


3. When the coefficients q and r are constants, the general solution of Equa- 
tion (2.4) has the form 
cye™!* + ene", 


where mı and mg are the roots of the second degree equation m?+qm+r = 
0 when the roots are distinct. If mı = mz = m, then the solution takes 
the form cye™* + coxe™”, in which the functions e™” 
linearly independent. 


ME 


and xe” are clearly 


4. When ao(x) = x”, ai(x) = az, and a2(x) = b, where a and b are constants, 
the homogeneous version of Equation (2.1) becomes 


ay” + axy’ + by = 0, 
which is called the Cauchy—Euler equation. Its general solution is given by 
cya"? + ega™, 


where mı and mg are the distinct roots of m? + (a — 1)m + b = 0. When 
Mı = M2 = m, the solution is c1ıx™ + cox™ log x. 


5. If the coefficients q and r are analytic functions at some point xo in the 
interior of J, which means each may be represented in an open interval 
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centered at xo by a power series in (x — xo), then the general solution of 
(2.4) is also analytic at xo, and is represented by a power series of the form 


Co 


X cn(a@ — £0)”. 


n=0 
The series converges in the intersection of the two intervals of convergence 
(of q and r) and I. Substituting this series into Equation (2.4) allows us to 
express the coefficients cn, for all n € {2,3,4,...}, in terms of co and c1, 
which remain arbitrary. 


With I = [a,b], the solutions of Equation (2.1) may be subjected to bound- 
ary conditions at a and b. These can take one of the following forms: 


(i) yo =E, y(e)=n, cE {a,b}, 
(ii) yfa)=€,  yb)=n, 
(iii) y'(a) =E, y (b) =n. 


When the boundary conditions are given at the same point c, as in (i), they are 
often referred to as initial conditions as mentioned earlier. For the purpose of 
obtaining a unique solution of Equation (2.1), the point c need not, in general, 
be one of the endpoints of the interval J, and can be any interior point. But 
in this presentation, as in most physical applications, boundary (or initial) 
conditions are always imposed at the endpoints of I. The forms (i) to (iii) of 
the boundary conditions may be generalized by the pair of equations 


ayla) + azy (a) + agy(b) + asy’ (b) = £, (2.6) 
Byy(b) + B2y' (b) + B3y(a) + Bay’ (a) =n, 


where a; and 8; are constants that satisfy peer |a;| > 0 and par |G,| > 0, 
that is, such that not all the a; or 8; are zeros. The system of Equations (2.1), 
(2.6), and (2.7) is called a boundary-value problem. 

The boundary conditions (2.6) and (2.7) are called homogeneous if € = = 
0, and separated if a3 = a4 = ba = p4 = 0. Separated boundary conditions, 
which have the form 


ayla) +azy'(a)=€, By y(b) + Bay (b) = n, (2.8) 


are of particular significance in this study. Another important pair of homoge- 
neous conditions, which result from a special choice of the coefficients in (2.6) 
and (2.7), is given by 


y(a)=y(b),  y'(a) = y'(b). (2.9) 


Equations (2.9) are called periodic boundary conditions. Note that periodic 
conditions are coupled, not separated. 
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Definition 2.1 
For any two functions f,g € C! the determinant 
f(x) g(x) | / ; 
W(f,g)(“) = = f(x)g (x) —g(a)f (x 
(falay=| E 90) |= roe) - e)a) 
is called the Wronskian of f and g. The symbol W (f, g)(x) is sometimes ab- 
breviated to W(x). 


The Wronskian derives its significance in the study of differential equations 
from the following lemmas. 


Lemma 2.2 
If yı and y2 are solutions of the homogeneous equation 
y” +q(x)y’ +r(x)y =0, zed, (2.10) 


where q € C(I), then either W (y1, y2)(x) = 0 for all x € I, or W(y1, yo) (x) # 0 
for any x € I. 


Proof 

From Definition 2.1, 
W' = yyy — you 

Because yı and y2 are solutions of Equation (2.10), we have 
yi + ay + ry = 0, 
Y2 + qyz + ryz = 0. 


Multiplying the first equation by y2, the second by yı, and subtracting, yields 


1 IEN 1 IN 
Ya — Y2Y1 + q4(Y1Y2 — Y2Y1) = 0 
=> W'+qw=0. 


Integrating this last equation, we obtain 
W(x) = cexp (-/ a(t) ; eel, (2.11) 


where c is an arbitrary constant. The exponential function does not vanish for 
any (real or complex) exponent, therefore W(x) = 0 if, and only if, c = 0. 
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Remark 2.3 


With q € C(I), the expression (2.11) implies that both W and W” are contin- 
uous. 


Lemma 2.4 


Any two solutions yı and y2 of Equation (2.10) are linearly independent if, and 
only if, W(y1, y2)(x) £0 on I. 


Proof 


If yı and y2 are linearly dependent, then one of them is a constant multiple of 
the other, and therefore W (y1, y2)(x) = 0 on I. Conversely, if W (y1, y2)(x) = 0 
at any point in J, then, by Lemma 2.2, W (y1, y2)(£) = 0. From the properties 
of determinants, this implies that the vector functions (y1, y1) and (y2, y4) are 
linearly dependent, and hence yı and yz are linearly dependent. 


Remark 2.5 


We used the fact that yı and y2 are solutions of Equation (2.10) only in the 
second part of the proof, the “only if” part. That is because the Wronskian of 
two linearly independent functions may vanish at some, but not all, points in 
I. Consider, for example, x and x? on [—1, 1]. 


Example 2.6 


The equation 
y” +y=0 (2.12) 
has the two linearly independent solutions, sinx and cos x. Hence the general 
solution is 
y(x) = ccos x + csin x. 


Note that 
W (cosa, sin x) = cos? x + sin? x = 1 for all x € R. 


If Equation (2.12) is given on the interval [0,7] subject to the initial condi- 
tions 
y(0)=0,  y'(0)=1, 


we obtain the unique solution 


y(x) = sinz. 
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But the homogeneous boundary conditions 


y(0)=0,  y'(0)=0, 


yield the trivial solution y = 0, as would be expected. 
On the other hand, the boundary conditions 


y(0)=0, — y() = 9, 
do not give a unique solution because the pair of equations 
y(0) = cı cos0 + c2 sin 0 = cı = 0 
y(t) = cy cosm + cg sina = —c = 0 


does not determine the constant c2. The determinant of the coefficients in this 
system is 

cos0 sin0 
cosm sina 


= 0. 


This last example indicates that the boundary conditions (2.6) and (2.7) do 
not uniquely determine the constants cı and cz in the general solution in all 
cases. But the (initial) conditions 


y(to) =, = (to) =n, 


always give a unique solution, because the determinant of the coefficients in 
the system of equations 


c1yi(xo) + c2y2(£0) = € 


C1Y9(%0) + c2y3(z0) = N 
is given by 


yı(zo) y2(z0) | _ a 
yhoo) ghlao) |7 7 02o), 


which cannot vanish according to Lemma 2.4. 


In general, given a second-order differential equation on [a,b], the separated 
boundary conditions 


ary(a) + azy (a) = €, 
Byy(b) + Bay' (b) = n, 
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imposed on the general solution y = cy, + C242, yield the system 


ci[aryi(a) + aay; (a)] + c2lary2(a) + azys(a)] = £, 
c1[8y1(b) + Boy} (0)] + c2[B1y2(b) + Boyo(b)] = n. 


Hence the constants cı and c2 are uniquely determined if, and only if, 


(a1y2 + azy3)(a) 
(Byy2 + Boys) (d) 


(aiyı + a2y})(a) 


(Biy1 + Boy) (0) (2.13) 


#0. 


EXERCISES 


2.1 


2.2 
2.3 


2.4 


2.5 


2.6 


2.7 


Find the general solution of each of the following equations. 
(a) y” — 4y' + Ty = e. 
(b) ay” — y' = 327. 


i 


+ 3ay +y=a-1. 


(c) ay 
Use the transformation t = \/z to solve ry” + y'/2 — y = 0. 


Use power series to solve the equation y” + 2zry’ + 4y = 0 about the 
point x = 0. Determine the interval of convergence of the solution. 


Solve the initial-value problem 


(xy’-y)=0,  -l<a@<l, 


y'(0) =1. 


1 
y nET 


0, 


If %1, P2, and pz are solutions of Equation (2.4), prove that 


Pı P2 P3 
pi p2 pz |=0. 
vi P2 P3 


If yı and y2 are linearly independent solutions of Equation (2.4), 
show that 
_ YY = youll 


_ yey = Wye = 
Wu, y2) 


W (y1, y2) i 


For each of the following pair of solutions, determine the correspond- 
ing differential equation of the form apy” + aiy’ + azy = 0. 
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(a) e~*cos 2a, e~*sin 2x. 
(b) x, 27", 
(c) 1, log z. 


2.2 Zeros of Solutions 


It is not necessary, nor is it always possible, to solve a differential equation of 
the type 
y” +q(x)y' +r(x)y =0, cel, (2.14) 


explicitly in order to study the properties of its solutions. Under certain con- 
ditions, the parameters of the equation and its boundary conditions determine 
these properties completely. In particular, such qualitative features of a solution 
as the number and distribution of its zeros, its singular points, its asymptotic 
behaviour, and its orthogonality properties, are all governed by the coefficients 
q and r and the given boundary conditions. We can therefore attempt to de- 
duce some of these properties by analysing the effect of these coefficients on the 
behaviour of the solution. In this section we shall investigate the effect of q and 
r on the distribution of the zeros of the solutions. Orthogonality is addressed 
in the next two sections. 


In Example 2.6 we found that the two solutions of y” + y = 0 on R had an 
infinite sequence of alternating zeros distributed uniformly, given by 


< ae Pee le 
ae 25753 ' 


where {nz : n € Z} are the zeros of sin x and {7/2 + nm: n € Z} are those of 
cos x. We shall presently find that this is not completely accidental. 

A function f : I > C is said to have an isolated zero at xo € I if f(xo) = 0 
and there is a neighbourhood U of x9 such that f(x) 4 0 for all x € IAU\{z0}. 


Lemma 2.7 


If y is a nontrivial solution of the homogeneous Equation (2.14), then the zeros 
of y are isolated in T. 


Proof 


Suppose y(xo) = 0, where y is a solution of (2.14). If y’(xo) = 0, then y 
is identically 0 by the uniqueness theorem. If y/(ao) 4 0 then, because y’ is 
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continuous on J, there is a neighbourhood U of zg where y 4 0 on UNIT 
(see [1]). Consequently y is either strictly increasing or strictly decreasing on 
UAI. 


Theorem 2.8 (Sturm Separation Theorem) 
If yı and y2 are linearly independent solutions of the equation 
y” +qlæ)y' +r(z)jy=0, cel, 


then the zeros of yı are distinct from those of y2, and the two sequences of zeros 
alternate; that is, yı has exactly one zero between any two successive zeros of 
y2, and viceversa. 


Figure 2.1 The alternating zeros of yı and ye. 


Proof 


Because yı and y2 are linearly independent, their Wronskian 


W (y1, y2) (£) = yr (a) yo (x) — y2(x)yı (x) 


does not vanish, and has therefore one sign on J (Lemma 2.4). Note first that yı 
and y2 cannot have a common zero, otherwise W would vanish there. Suppose 
xı and x2 are two successive zeros of y2. Then 


W(a1) (a1) y5(a1) Æ 0, 
W (a2) (x2)yo(x2) 0, 


and the numbers yi(%1), yi(v2), y$(w1), and y$(a#2) are all nonzero. Because 
yh is continuous on J, x; has a neighborhood U, where the sign of y4 does 
not change, and similarly there is a neighborhood U2 of x2 where y4 does not 
change sign. But the signs of y4 in U, NI and U2 N I cannot be the same, for 


=y 
= yı 


2.2 Zeros of Solutions 51 


if y2 is increasing on one then it has to be decreasing on the other. For W (x) 
to have a constant sign on J, yı(xı) and yi (x2) must therefore have opposite 
signs; hence yı, being continuous, has at least one zero between x; and z2. 
There cannot be more than one such zero, for if x3 and x4 are two zeros of yı 
which lie between x; and x2, we can use the same argument to conclude that 
Yə vanishes between x3 and x4. But this contradicts the assumption that xı 


and x2 are consecutive zeros of yo. 


Corollary 2.9 


If two solutions of y” + q(x)y’ + r(x)y = 0 have a common zero in J, then they 
are linearly dependent. 


In order to study the distribution of zeros of Equation (2.14), it would be 
much more convenient if we could get rid of the middle term qy’ by transforming 


the equation to 
u” + p(x)u = 0. (2.15) 
To that end we set 


so that 

y (x) = ul(a)o(x) + u(x)v (z), 

y” (x) = u” (x)o(a) + 2u' (x)v(x) + u(x)v" (x). 
Substituting into Equation (2.14) yields 


vu” + (2v + qv)ul + (vw + qu’ + rv)u = 0. 


Thus we obtain (2.15) by choosing 2v’ + qv = 0, which implies 


ise (-3 | aoar) , (2.16) 


1 2 1 1 
pla) = r(x) = 38) - 3da). 
The exponential function v never vanishes on R, thus the zeros of u coincide 
with those of y, and we may, for the purpose of investigating the distribution 


of zeros of Equation (2.14), confine our attention to Equation (2.15). 


Theorem 2.10 (Sturm Comparison Theorem) 
Let y and w be nontrivial solutions of the equations 


y” + ri(x)y = 0, 
u” + ro(x)u = 0, cel, 
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respectively, and suppose rı(x) > rə(x) for all x € I. Then ọ has at least one 
zero between every two consecutive zeros of p, unless r(x) = rə(x) and y isa 
constant multiple of w. 


Proof 


Let xı and x2 be any two consecutive zeros of w on J, and suppose that y has 
no zero in the open interval (x1, £2). Assume that both y and w are positive 
on (21,22), otherwise change the sign of the negative function. Since y’ and 
wv’ are continuous, it follows that ~'(2,) > 0 and ~'(x2) < 0, and therefore the 
Wronskian of y and ¢~ satisfies 


W (a1) = (21)0"(a1) >0,  W(z2) = 9(a2)¥' (x2) < 0. (2.17) 
But because 
W' (x) = p(x)" (a) — p" (x) v(2) 
= [ri(x) — ro(x)]p(a) v(x) > 0 for all x € (x1, £2), 


W is an increasing function on (#1, £2). This contradicts Equation (2.17) unless 
rı(x)—rə(x) =0 and W(x) = 0, in which case y and w are linearly dependent 
(by Lemma 2.4). 


Corollary 2.11 


Let y be a nontrivial solution of y” + r(x)y = 0 on I. If r(x) < 0 then ọ has 
at most one zero on I. 


Proof 


If the solution ọ has two zeros on J, say xı and x2, then, by Theorem 2.10, the 
solution w(2) = 1 of u” = 0 must vanish on (21,22), which is impossible. 


Example 2.12 
(i) Any nontrivial solution of y” = 0 on R is a special case of 
p(z) = cT + c2, 


which is represented by a straight line, and has at most one zero. 


ii) The equation y” — y = 0 has the general solution 
y y 8 


p(z) = ce” + ene”, LER. 
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If cı and c2 are not both zero, then y(x) Æ 0 for any x € R unless cp = — c1, 
in which case y has one zero at x = 0. 


(iii) The solution of y” + y = 0 is given by 
p(x) = cicos x + cosin x = asin(x — b), 


where a = yc? + c3 and b = — arctan (c1 /c2). If a 4 0, p has an infinite number 
of zeros given by £n =b+na, n€Z. 


A nontrivial solution of 
y"+r(ajyy=0, wel, (2.18) 


is called oscillatory if it has an infinite number of zeros, as in Example 2.12(iii). 
According to Theorem 2.10, whether this equation has oscillatory solutions de- 
pends on the function r. If r(x) < 0, the solutions cannot oscillate by Corollary 
2.11; but if 
r(x) > k? > 0, sel, 
for some positive constant k, then any solution of (2.18) on J has an infinite 
number of zeros distributed between the zeros of any solution of y” + k?y = 0, 
such as asin k(x — b), which are given by 
nT 


n=b+—. 
x tor 


Thus every subinterval of I of length m/k has at least one zero of Equation 
(2.18), and as k increases we would expect the number of zeros to increase. 
This, of course, is clearly the case when r is constant. 

From the Sturm separation theorem we also conclude that, if the interval I 
is infinite and one solution of the equation 


y” + q(x)y' + r(x)y = 0 


oscillates, then all its solutions oscillate. 


Example 2.13 


The equation 


H 1 / v? 
y +-y+ {1 5) ¥=9, 0<4<ow, (2.19) 
£ 


is known as Bessel’s equation of order v, and is the subject of Chapter 5. Using 
formula (2.16) to define u = yzy, Bessel’s equation under this transformation 


takes the form 
j 1 — 4p? 
u +|1+ u=0. (2.20) 
4x? 
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By comparing Equations (2.20) and u” + u = 0, we see that 


1-4? ( >1 if0<vK< 
aaia <5 if0<v< 1/2 


4z? <1 ifv>1/2. 
Applying Theorem 2.10 we therefore conclude: 


(i) If 0 < v < 1/2 then, in every subinterval of (0, co) of length 7, any solution 
of Bessel’s equation has at least one zero. 


(ii) If v > 1/2 then, in every subinterval of (0,00) of length 7, any nontrivial 
solution of Bessel’s equation has at most one zero. 


(iii) If v = 1/2, the distance between successive zeros of any nontrivial solution 
of Bessel’s equation is exactly 7. 


EXERCISES 


2.8 Prove that any nontrivial solution of y” + r(x)y = 0 on a finite 
interval has at most a finite number of zeros. 


2.9 Prove that any nontrivial solution of the equation 


k 
y” + sy =0 
x 
on (0,00) is oscillatory if, and only if, k > 1/4. Hint: Use the substi- 
tution x = é. 


2.10 Use the result of Exercise 2.9 to conclude that any nontrivial solution 
of the equation y” + r(a)y = 0 on (0,00) has an infinite number of 
zeros if r(x) > k/x? for some k > 1/4, and only a finite number if 
r(x) < 1/4a?. 


2.11 Let y be a nontrivial solution of y” + r(x)y = 0 on (0,00), where 
r(x) > 0. If y(x) > 0 on (0,a) for some positive number a, and if 
there is a point zo € (0,a) where y’(xo) < 0, prove that y vanishes 
at some point zı > £o. 


2.12 Determine which equations have oscillatory solutions on (0, 00): 
(a) y” + (sin? z + 1)y = 0. 
(b) y” = x°y =0. 
1 
(c) y” + ~y =0. 
x 


2.13 Find the general solution of Bessel’s equation of order 1/2, and de- 
termine the zeros of each independent solution. 
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2.14 If lim, f(x) = 0, prove that the solutions of y” + (1+ f(x))y = 0 
are oscillatory. 


2.15 Prove that any solution of Airy’s equation y” +zxy = 0 has an infinite 
number of zeros on (0,00), and at most one zero on (—oo,0). 


2.3 Self-Adjoint Differential Operator 


Going back to the general form of the linear second-order differential Equation 
(2.1), in slightly modified notation, 


p(a)y" + q(a)y! + r(x)y = 0, (2.21) 
we now wish to investigate the orthogonality properties of its solutions. This 


naturally means we should look for the C? solutions of (2.21) which lie in L?. 
Equation (2.21) can be written in the form 


Ly = 0, 


where 


L= pa) <5 + de) Ë + r(x), (2.22) 
is a linear differential operator of second order, and y lies in £?(I) N C7 (1), 
which is a linear vector space, being the intersection of two such spaces with 
the same operations. 
To motivate the discussion, it would help at this point to recall some of the 
notions of linear algebra. A linear operator in a vector space X is a mapping 


A:X>X 
which satisfies 
A(ax + by) = aAx+bAy foralla,bEF, x,y EX. 


If X is an inner product space, the adjoint of A, if it exists, is the operator A’ 


which satisfies 
(Ax, y) = (x, A’y) for all x,y € X. 


If A’ = A, then A is said to be self-adjoint. 
If X is a finite-dimensional inner product space, such as C” over C, we know 


that any linear operator is represented, with respect to the orthonormal basis 
{e;:1<i<n}, by the matrix 


Q11 `? Gin 


A=] : = (aij). 
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Its adjoint is given by 


Qi © An 
A = ; : = (a,,) = AT, 


Gin pe ünn 


where A is the complex conjugate of A, 


Qi > Gin 
A= F 
Gn1 eae ünn 
and AT is its transpose, 
Qil °t? Qni 
AT = 
Qin > ann 


A complex number a is called an eigenvalue of A if there is a nonzero vector 
x € X such that 
Ax = ax, 


and x in this case is called an eigenvector of the operator A associated with 
the eigenvalue a. From linear algebra (see, for example, [11]) we know that, if 
A is a self-adjoint (or Hermitian) matrix, then 


(i) The eigenvalues of A are all real numbers. 
(ii) The eigenvectors of A corresponding to distinct eigenvalues are orthogonal. 
(iii) The eigenvectors of A form a basis of X. 


In order to extend these results to the space £7, our first task is to obtain 
the form of the adjoint of the operator 


L: L? ACD > L?) 


defined by Equation (2.22), where we assume, to start with, that the coefficients 
p,q, and r are C? functions on I. Note that C? (I) N L? (T) = C?(L) when I is 
a closed and bounded interval. Denoting the adjoint of L by L’, we have, by 
definition of L’, 


(Lf,g) =(f,L'g) forall f,g € C?(I) NL7(J). (2.23) 
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We set I = (a,b), where the interval J may be finite or infinite, and use inte- 
gration by parts to manipulate the left-hand side of (2.23) in order to shift the 
differential operator from f to g. Thus 


b 
(Lf, 9) = S (pf" + af! +rf)gdx 


b b b 
= pf'gl, — f f(pg)' de + qafa? — f Fla) de + f frgdx 
b b b 
= [pf'9 — f(a lÈ + [f(@3)"dx + afal? — [flqg)'da + f frade 


= (f, (B9)” — (Gg)! + Fo) + P(F'9 — FF) + la- pn, 


where the integrals are considered improper if (a,b) is infinite or any of the 
integrands is unbounded at a or b. Note that the right-hand side of the above 
equation is well defined if p € C? (a,b), q € C! (a,b), and r € C (a,b). The last 
term, of course, is to be interpreted as the difference between the limits at a 
and b. We therefore have, for all f, g € £L?(1) Nn C?(D), 


(LF, g) = (F, L*9) + P'a — £9) + la- PASTE, (2.24) 


where 


The operator 


d? d 


Shae? (2p — q)— + (p" -g +r) 


is called the formal adjoint of L. L is said to be formally self-adjoint when 
L* = L, that is, when 


p=p, 2p - Gq =4, p’-@+r=r. 


These three equations are satisfied if, and only if, the functions p,q, and r are 
real and q = p’. In that case 


Lf=pf" +p f +rf 
= (pf’)' +rf. 


Thus, when L is formally self-adjoint, it has the form 
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and Equation (2.24) is reduced to 


(Lf,9) = (F, L9) + v(f'9- fa. (2.25) 


Comparing Equations (2.23) and (2.25) we now see that the formally self- 
adjoint operator L is self-adjoint if 

pfa- FF), =0 for allf,g € C?(I) N LPU). (2.26) 
It is worth noting at this point that, when q = p’, the term p” — g’ in the 
expression for L* drops out, hence the continuity of p” and q’ is no longer 
needed. 


We are interested in the eigenvalue problem for the operator —L, that is, 
solutions of the equation 
Lu+ Au = 0. (2.27) 


When u = 0 this equation, of course, is satisfied for every value of A. When 
u # 0, it may be satisfied for certain values of A. These are the eigenvalues 
of —L. Any function u 4 0 in C? N L? which satisfies Equation (2.27) for some 
complex number A is an eigenfunction of —L corresponding to the eigenvalue 
A. We also refer to the eigenvalues and eigenfunctions of —L as eigenvalues and 
eigenfunctions of Equation (2.27). Because this equation is homogeneous, the 
eigenfunctions of —L are determined up to a multiplicative constant. When 
appropriate boundary conditions are added to Equation (2.27), the resulting 
system is called a Sturm—Liouville eigenvalue problem, which is discussed in the 
next section. Clearly, — L is (formally) self-adjoint if, and only if, L is (formally) 
self-adjoint. The reason we look for the eigenvalues of —L rather than L is that, 
as it turns out, L has negative eigenvalues when p is positive (see Example 2.16 
below). The following theorem summarizes the results we have obtained thus 
far, as it generalizes properties (i) and (ii) above from finite-dimensional space 
to the infinite-dimensional space L? N C?. 


Theorem 2.14 


Let L: £L?(a,b) NC? (a,b) — L?(a,b) be a linear differential operator of second 
order defined by 


Lu = p(x)u" + q(x)u'+r(z)u, xE (a,b), 


where p € C?(a,b), q € C! (a,b), and r € C(a,b). Then 

(i) L is formally self-adjoint, that is, L* = L, if the coefficients p, q, and r are 
real and q = p’. 

(ii) L is self-adjoint, that is, L’ = L, if L is formally self-adjoint and Equation 
(2.26) is satisfied. 
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(iii) If L is self-adjoint, then the eigenvalues of the equation 
Lut Au = 0 


are all real and any pair of eigenfunctions associated with distinct eigenvalues 
are orthogonal in £L?(a, b). 


Proof 
We have already proved (i) and (ii). To prove (iii), suppose  € C is an eigen- 
value of —L. Then there is a function f € £?(a,b) N C?(a,b), f Æ 0, such 
that 
Lf+Af=0 
= APP = Off) =- (LFF). 


Because L is self-adjoint, 


= (Lf, f) == (FLP) = (FAP) = AMF. 
Hence À || fI? = AII f ||? . Because || f || 40, A = A. 
If u is another eigenvalue of —L associated with the eigenfunction g € 
L? (a,b) N C?(a, b), then 
Af, 9) = — (LF, 9) = — (F, Lg) = u (f, 9) 
(A— u) (f,9) = 0 
AF u => (fg) =0. 


Remark 2.15 


As noted earlier, when q = p’ in the expression for L, the conclusions of this 
theorem are valid under the weaker requirement that p’ be continuous. 


Example 2.16 


The operator —(d?/dx?) is formally self-adjoint with p = —1 and r = 0. To 
determine its eigenfunctions in C?(0,7), we have to solve the equation 


u” + du = 0. 


Let us first consider the case where A > 0. The general solution of the 
equation is given by 


u(x) = cicos VAx + cosin VA. (2.28) 
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Under the boundary conditions 
u(0) = u(t) = 0 
Equation (2.26) is satisfied, so — (d? /dx?) is, in fact, self-adjoint. Applying the 
boundary conditions to (2.28), we obtain 
u(0) =c, =0 
u(t) = cosin VAT = 0 VAT = nt A=n?, neEN. 


Thus the eigenvalues of —(d?/dzx?) are given by the sequence 
(n? : n € N) =(1,4,9,...), 
and the corresponding eigenfunctions are 
(sinang : n € N) = (sin z,sin 2z,sin3z,...). 


Observe that we chose c2 = 1 for the sake of simplicity, since the boundary 
conditions as well as the eigenvalue equation are all homogeneous. We could 


also divide each eigenfunction by its norm ||sinnz|| = 4/ fo sin? x dx = /7/2 


to obtain the normalized eigenfunctions 


(V2/rsinne Ine N) : 


If A = 0, the solution of the differential equation is given by cız + c2, and if 
A < 0 it is given by cycosh J/—Aax + cosinh V—Az. In either case the application 
of the boundary conditions at x = 0 and x = 7 leads to the conclusion that cı = 
co = 0. But the trivial solution is not admissible as an eigenfunction, therefore 
we do not have any eigenvalues in the interval (—oo, 0]. The eigenvalues Àn = n? 
are real numbers, in accordance with Theorem 2.14, and the eigenfunctions 


Un(x) = sin nz are orthogonal in £7(0,7) because, for all n 4 m, 


T 


=0. 


(0) 


A : . 1 [sin(n—m)x sin(n + m)x 
sin ng sin madx = 
0 2 n-m n+m 


Sometimes it is not possible to determine the eigenvalues of a system exactly, 
as the next example shows. 


Example 2.17 


Consider the same equation 
u” + d\u=0 
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on the interval (0,1), under the separated boundary conditions 
u(0) = 0, hu(l) + u'(l) = 0, 


where h is a positive constant. 
It is straightforward to check that this system has no eigenvalues in (—oo, 0]. 
When A > 0 the general solution is 


u(a) = cicos VÀz + cosin VAT. 
The first boundary condition implies cı = 0, and the second yields 
co(hsin VAL + VÀ cos VAL) = 0. 
Because c2 cannot be 0, otherwise we do not get an eigenfunction, we must 


have 
hsin VAL + VAcos VAL = 0. 


Because sin VÀl and cos VAl cannot both be 0, it follows that neither of them 
is. Therefore we can divide by cos VAL to obtain 


tan VAL = a 


Setting a = VAl, we see that a is determined by the equation 


tana = — 


Al’ 
This is a transcendental equation which is solved graphically by the points of 
intersection of the graphs of 


a 
y=tana and y= TR 


as shown by the sequence (a,,) in Figure 2.2. 
The eigenvalues and eigenfunctions of the problem are therefore given by 


Qn \ 2 
m= (FE) > 
Un(x) = sin(anz/l), 0<a<l, neéeN, 


and the conclusions of Theorem 2.14(iii) are clearly satisfied. 


The following example shows that, if p’ 4 q on I, the operator L in Theorem 
2.14 may be transformed to a formally self-adjoint operator when multiplied 
by a suitable function. 
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Figure 2.2 


Example 2.18 


Let 
ee ere xE I= [a,b] 

dx? dx , u™ 
where p € C?(I) does not vanish on I, q € C(I), and r € C(I). We may 
assume, without loss of generality, that p(x) > 0 for all x € I. If q # p’, we can 
multiply L by a function p and define the operator 

2 
L=pL= ppf + pas + pr. 


L will be formally self-adjoint if 


pq = (pp) = p'p + pp’. 


This is a first-order differential equation in p, whose solution is 


p(x) = oka (S La) , (2.29) 


where c is a constant. Note that p is a C? function which is strictly positive on 
I. It reduces to a (nonzero) constant when q = p’, that is, when L is formally 
self-adjoint. 


The result of Example 2.18 allows us to generalize part (iii) of Theorem 
2.14 to differential operators which are not formally self-adjoint. If Lu = pu” + 
qu’ + ru, where p > 0 and q Æ p', the eigenvalue equation 


Lu+àu=0 
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can be multiplied by the positive function p defined by (2.29) to give the equiv- 
alent equation 
pLu + Apu = 0, (2.30) 


where pL is now formally self-adjoint. With p > 0, Equation (2.26) is equivalent 
to 


- ib 
ppl f'g — FI) la = 0. 
This makes the operator pL self-adjoint. If u € Lr is an eigenfunction of L 
associated with the eigenvalue À, we can write 


A lul? = (Apu, u) 
(—pLu, u) 


II 


= (u,—pLu) 
= (u, Apu) 


2 
A llullg > 


II 


which implies that À is a real number. Furthermore, if v € Ls is another 
eigenfunction of L associated with the eigenvalue u, then 


(à = u) (u,v), = À (pu, v) — u (pu, v) 
= (Apu, v) — (u, upv) 
= (—pLu, v) — (u, —pLv) = 0 


2 


because pL is self-adjoint. Thus, if A # u, then u is orthogonal to v in £%. 


We have therefore proved the following extension of Theorem 2.14(iii): 


Corollary 2.19 


If L: L?(a,b) N C?(a,b) — L£?(a, bd) is a self-adjoint linear operator and p is a 
positive and continuous function on [a, b], then the eigenvalues of the equation 


Lu + Apu = 0 


are all real and any pair of eigenfunctions associated with distinct eigenvalues 
are orthogonal in £2 (a,b). 


Remark 2.20 


1. In this corollary the eigenvalues and eigenfunctions of the equation Du + 
Apu = 0 are, in fact, the eigenvalues and eigenfunctions of the operator — p71 L. 
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2. Suppose the interval (a,b) is finite. Since the weight function p is continuous 
and positive on [a,b], its minimum value a and its maximum value ( satisfy 


0<a< plx) << %. 


This implies 

va llull < lull, < V8 llull; 
and therefore ||u||, < œ if, and only if, |u|] < oo. The two norms are said to be 
equivalent, and the two spaces L(a, b) and £?(a, b) clearly contain the same 
functions, although they have different inner products. 
3. Nothing in the proof of this corollary requires L to be a second-order differ- 
ential operator. In fact, the result is true for any self-adjoint linear operator on 
an inner product space. 


Example 2.21 
Find the eigenfunctions and eigenvalues of the boundary-value problem 
xy" + xy’ + ày =0, l<a<b, (2.31) 
y(1) = y(b) = 0. 


Solution 


Equation (2.31) is a Cauchy—Euler equation whose solutions have the form x”. 
Substituting into the equation leads to 


m(m—1)+m+rA=0 
> m = tiv). 


Assuming à > 0, we have 
giV> = eiA lgs — cos(VAlog x) + isin(VÀlog z), 
and the general solution of Equation (2.31) is given by 


y(x) = cicos( VA log x) + czsin( VÀ log £). 
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Applying the boundary conditions (2.32), 
y(1)=c,=0, — -y(b) = cgsin(V/Alog b) = 0. 
Because we cannot have both constants cı and cy vanish, this implies 


sin(VA log b) = 0 
=> V\logb = nr, neN. 


Thus the eigenvalues of the boundary-value problem are given by the sequence 


of positive real numbers 
nr \? 
An = IOa i} N ’ 
(a 5) ie 


and the corresponding sequence of eigenfunctions is 
Yn(x) = sin( log z) : 
Observe here that the differential operator 
ae d 
a —s + 4a— 
dx? dx 


is not formally self-adjoint, but it becomes so after multiplication by the weight 


1 Pal 1 
p(x) = (| it) TT 
T ı t x 


The resulting operator is then 


2 
L La (a£) 


A dr dz E 


function 


The eigenfunctions (yn : n € N) of this problem are in fact eigenfunctions of 
—p7+L = —zL, which are orthogonal in £?(1, 6), not £?(1,b). Indeed, 


b 
MT NT 1 
m: Yn) = sin | —— 1 in| — ] —d 
(Ym, Y T f sin = oge sin ( 2 oge) z412 


l T 
= æ | sin mÅ sin n€dé 
T Jo 
=0 fralm#n. 


We leave it as an exercise to show that this problem has no eigenvalues in 
the interval (—oo, 0]. 
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EXERCISES 
, d d : : 
2.16 Given L = — | p— | +r, prove the Lagrange identity 
dx \ dx 
uLv — vLu = [p(w — vu'))’. 
The integral of this identity, 
b 
J (uLv — vLu)dzx = [pluv — vu’)? , 
is known as Green’s formula. 
2.17 Determine the eigenfunctions and eigenvalues of the differential op- 
erators 
da 
(a) — : C?(0,c0) > C(0, 00). 
dz? 
d2 
(b) Ta ` L£? (0, 00) N C?(0, 00) — £? (0, ov). 

2.18 Prove that the differential operator —(d?/dx?) : £L7(0,)NC?(0, 7) > 
L?(0, 7) is self-adjoint under the boundary conditions u(0) = u!(7) = 
0. Verify that its eigenvalues and eigenfunctions satisfy Theorem 
2.14(iii). 

2.19 Put each of the following differential operators in the form p(d?/dx?) 
+ p'(d/dx) + r, with p > 0, by multiplying by a suitable weight 
function: 

d2 
2 
(a) x im * > 0, 
d? d 
R 
(b) dx2 "dr * l 
d? 2d 
— — x 0 
(c) dx? aoa 
d? d 
(d) a ter +(z?— A), z>0. 

2.20 Determine the eigenvalues and eigenfunctions of u” +u = 0 on (0,1) 
subject to the boundary conditions u(0) = 0, hu(l)+u’' (1) = 0, where 
h <0. 

2.21 Put the eigenvalue equation u” + 2u’ + Au = 0 in the standard form 


Lu + Apu = 0, where L is self-adjoint, then find its eigenvalues and 
eigenfunctions on [0,1] subject to the boundary conditions u(0) = 
u(1) = 0. Verify that the result agrees with Corollary 2.19. 


2.4 The Sturm-—Liouville Problem 67 


2.22 Determine the eigenfunctions and eigenvalues of the equation x7u!’ + 
Au = 0 on [1, e], subject to the boundary conditions u(1) = u(e) = 0. 


2.4 The Sturm—Liouville Problem 


In Section 2.3 we saw how a linear differential equation may be posed as an 
eigenvalue problem for a differential operator in an infinite-dimensional vector 
space. The notion of self-adjointness was defined by drawing on the analogy 
with linear transformations in finite-dimensional space, and we saw how the 
first two properties of a self-adjoint matrix, namely that its eigenvalues are 
real and its eigenvectors orthogonal, translated (almost word for word) into 
Theorem 2.14. The third property, that its eigenvectors span the space, is in 
fact an assertion that the matrix has eigenvectors as well as a statement on 
their number. This section is concerned with deriving the corresponding result 
for differential operators, which is expressed by Theorems 2.26 and 2.28. 


Let L be a formally self-adjoint operator of the form 


d d 
L= — | p(x)— } +r(2). (2.33) 
dx dx 
The eigenvalue equation 
Lu+ Ap(x)u=0, x€ (a,b), (2.34) 


subject to the separated homogeneous boundary conditions 


ayu(a) + asu’ (a) = 0, lail + læ2| > 0, (2.35) 
B,u(b) + Bau’ (b) =0, [bıl + |82| > 0, (2.36) 


where a; and ĝ; are real constants, is called a Sturm—Liouville eigenvalue prob- 


lem, or SL problem for short. 

Because L is self-adjoint under these boundary conditions, we know from 
Corollary 2.19 that, if they exist, the eigenvalues of Equation (2.34) are real 
and its eigenfunctions are orthogonal in £2(a,b). When the interval (a,b) is 
bounded and p does not vanish on [a,b], the system of equations (2.34) to 
(2.36) is called a regular SL problem, otherwise it is singular. In the regular 
problem there is no loss of generality in assuming that p(x) > 0 on [a,b]. The 
solutions of the SL problem are clearly the eigenfunctions of the operator —L/p 
in C? which satisfy the boundary conditions (2.35) and (2.36). 

Now we show that the regular SL problem not only has solutions, but that 
there are enough of them to span Lea, b). This fundamental result is proved 
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in stages. Assuming, for the sake of simplicity, that p(x) = 1, we first construct 
Green’s function for the operator L under the boundary conditions (2.35) and 
(2.36). This choice of p avoids some distracting details without obscuring the 
general principle. We then use Green’s function to arrive at an integral expres- 
sion for L7}, the inverse of L. The eigenfunctions of — L, 


Lu + Au = 0, 
coincide with those of — L71, 
L'u + pu =0, 


and the corresponding eigenvalues are related by u = 1/X. We arrive at the 
desired conclusions by analysing the spectral properties of L71, that is, its 
eigenvalues and eigenfunctions. 

We shall have more to say about the singular problem at the end of this 
section. 


2.4.1 Existence of Eigenfunctions 
Green’s function for the self-adjoint differential operator 


,d 
TP 


d 
=P dx 


ae at 


under the boundary conditions 


ayu(a) + azu’ (a) = 


Bulb) + bau (b) = 


, [61l + |62| > 0, 


is a function 
G : [a,b] x [a,b] — R 


with the following properties. 
1. G is symmetric, in the sense that 
G(x, £) = G(E, x) for all x, € [a,b], 
and G satisfies the boundary conditions in each variable x and €. 


2. G is a continuous function on the square [a,b] x [a,b] and of class C? on 
[a, b] x [a, b]\{(x, £) : £x = £}, where it satisfies the differential equation 


Ly G(x, €) = p(£)Gra (£, €) + p'(2)Ge(a, £) + r(x)G(a, €) = 0. 
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3. The derivative 0G/Ox has a jump discontinuity at x = € (see Figure 2.3) 
given by 


aG AG. _ 1a, , ac 
age 8) = JE 3) = Jim 3x (£ j ô, €) Ox (£ 6, £) 


ee (2.37) 


p 
es 


Figure 2.3 Green’s function. 


We assume that the homogeneous equation Lu = 0, under the separated 
boundary conditions (2.35) and (2.36), has only the trivial solution u = 0. This 
amounts to the assumption that 0 is not an eigenvalue of —L. There is no 
loss of generality in this assumption, for if À is a real number which is not an 
eigenvalue of —L then we can define the operator 


L 


II 
S&S 
> 

II 
RS 


where F(x) = r(x) + À. Now, in as much as 


Lu + àu = Lu + (à — Aju, 


we see that À is an eigenvalue of — L associated with the eigenfunction u if and 
only if A — dis an eigenvalue of —L associated with the same eigenfunction u. 
Because À is not an eigenvalue of —L, 0 cannot be an eigenvalue of L. That 
there are real numbers which are not eigenvalues of —L follows from the next 
lemma. 
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Lemma 2.22 


The eigenvalues of —L are bounded below by a real constant. 


Proof 


For any u € C?({a, b]) which satisfies the boundary conditions (2.35) and (2.36), 
we have 


b 
EmTe / [-(pv' yt — r ful? 


b 
=| [p |u"? — r |ul?|dx + p(a)u!(a)u(a) — p(b)u' (b)u(b) 


a 2 b Qı 
=| [p|u’|” — r jul"]da + p(b) =u? (b) — pla) —w?(a). 


b2 Q2 
If 8 = 0 then the second boundary condition implies u(b) = 0 and the second 
term on the right-hand side drops out. Similarly the third term is 0 if ag = 0. 
The case where u is an eigenfunction of — L with boundary values u(a) = u(b) = 
0 immediately yields 


b b 
Aull? = f p(x) KoPe- f r(x) \u(a)|” dx 


a 


> — lull? max{|r(x)| : a < £ < b}, (2.38) 


hence £ = — max{|r(a)|: a <x < by is a lower bound of A. 


On the other hand, if u satisfies the separated boundary conditions a;u(a)+ 
azu’ (a) = 0, B,u(b) + Bau’ (b) = 0, then the following dimensional argument 
shows that there can be no more than two linearly independent eigenfunctions 
of —L with eigenvalues less than £. 

Seeking a contradiction, suppose —L has three linearly independent eigen- 
functions u1, u2, and ug with their corresponding eigenvalues A;, Az, and A3 all 
less than @. We can assume, without loss of generality, that the eigenfunctions 
are orthonormal. Since 


ayu;(a) + agu;(a) = 0, 
By ui(b) + Byu;(b) = 0, 


we see that each of the six vectors (u;(a),u;(a)) and (u;(b), ui(b)) lies in a 
one-dimensional subspace of R?. Therefore the three vectors u; = (u;(a), u; (a), 
u;(b), u;(b)) lie in a two-dimensional subspace of R*. We can therefore form a 
linear combination c,u 4 + C2U2 + c3u3, where not all the coefficients are zeros, 


such that 


i=1,2,3, 


C1 Uy + C2U2 + C3U3 = O. 
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But this implies 


CU, (a) + coU2(a) + c3u3(a) = 


cu (b) + c2u2(b) + c3u3(b) = 


? 


The function 
v(x) = cru (x) + c2u2(x) + czuz (x) 


is therefore an eigenfunction of —L which satisfies v(a) = v(b) = 0, and conse- 
quently its eigenvalue is bounded below by £. But this is contradicted by the 
inequality 


(—Lv,v) = à |a|? + Ag c2]? + Ag leal? < Llc? + leal? + lesl® = 2 lull? 


Now we construct Green’s function for the operator L under the boundary 
conditions (2.35) and (2.36), 


ayu(a) + agu’(a) = 0, lail + l@2| > 0, 
Bulb) + Bau (b) = 0, |61] + |62] > 0. 


According to a standard existence theorem for second-order differential equa- 
tions [6], Lu = 0 has two unique (nontrivial) solutions vı and v2 such that 


vi(a)=a2, (a) =—an, 
v2(b) = Bo, v9(b) = —ĝı. 
Thus v; satisfies the first boundary condition at x = a, 
avı la) + aovi (a) = 0, 


and və satisfies the second condition 


B102(b) + Bav3(b) = 0. 


Clearly vı and v2 are linearly independent, otherwise each would be a (non- 
zero) constant multiple of the other, and both would then satisfy Lu = 0 
and the boundary conditions (2.35) and (2.36). But this would contradict the 
assumption that 0 is not an eigenvalue of L. 

Now we define 


where 
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is a nonzero constant. This follows from the fact that neither p nor W vanishes 
on [a,b], and the Lagrange identity (see Exercise 2.16) 

[pluvivi — vi vo)| = vı Lvg — veLv, = 0 on fa, dB}. 


It is now a simple matter to show that all the properties of G(€, x) listed above 
are satisfied. Here we verify Equation (2.37). Differentiating (2.39) we obtain 


OG OG 
OE (x, a + ô) 


1 
(x, a — 6) = z[v (x)vz(x + ô) — vi (x — 4)v2(x)], 
where ô > 0. In view of (2.40) and the continuity of vi and v5, this expression 


tends to 1/p(x) as 6 > 0. 
Now we define the operator T on C([a, b]) by 


b 
. J: G(æ, £) f()aé, (2.41) 


and show that the function Tf is of class C?({a,b]) and solves the differential 
equation Lu = f. Rewriting (2.41) and differentiating, 


z b 
z i G(x, £) f(Edé + I G(x, f(a, 
G b 
=| a Gr (a, €) f(E)dg, 
TH" j=j Goal, E) f(OME + Gelz, 17) f (27) 


b 
+ fe Gealt, €) f(€)d€ — Ge (2, £+) f(a), 


where we have used the continuity of G and f at € = x to obtain (T fY (x). 
Because 


1 1 
Gz(a,2~) — Gex, £+) = —[vi(2™ )v3 (x) — v}(2)v2(2")] = —, 
p(x) 
by the continuity of vı and v2, we also have 
yr" l f(x) 
TVE = f CEDO f Cale ONO EE, 
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from which we conclude that Tf lies in C?({a,b]) and satisfies 
L(T'f)(x) = p(x)(Tf)"(x) + p'(2)\(TF)' (x) + r(x) (TF) (x) 
x b 
= | LCDI | LG EOE + fle) 


in view of the fact that LzG(x, £) = 0 for all € A zx. 

From property (1) of G it is clear that Tf also satisfies the boundary con- 
ditions (2.35) and (2.36). On the other hand, if u € C?([a,b]) satisfies the 
boundary conditions (2.35) and (2.36), then we can integrate by parts and use 
the continuity of p, u, u’ and the properties of G to obtain 


x b 
T(Lu)(2) = f G(æ, €) Lu dé + | G(w, €)Lu(é)dé 


= p(E) [u'(E)G(a, £) — uli) Gelz, Ela +f u(E)LeG(a, dg 


a 


b 
+P [uw OG.) -OGE f OLGE 


b 


at 
= p(f)u(g)Ge(a, €)|,- + PCE) [W (EGC, £) — u(E)Ge(a, Ela 
u(x), 
where we have used Equation (2.37) and the fact that u and G satisfy the 
separated boundary conditions. The operator T therefore acts as a sort of 
inverse to the operator L, and the SL system of equations 
Lu + àu = 0, 

ayu(a) + azu'(a) = 0, 

Byu(b) + Bgu'(b) = 0, 
is equivalent to the single eigenvalue equation 
Tu = pu, 


where u = —1/X. In other words, u is an eigenfunction of the SL problem 
associated with the eigenvalue À if and only if it is an eigenfunction of T 
associated with the eigenvalue —1/A. We shall deduce the spectral properties 
of the regular SL problem from those of the integral operator T. 


The following example illustrates the method we have described for con- 
structing Green’s function in the special case where the operator L is d?/dx? 
on the interval [0, 1]. 
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Example 2.23 
Let 


u” + du =0, 0<a2<l, 
u(0) = u(1) = 0. 


We have already seen in Example 2.16 that this system of equations has only 
positive eigenvalues. The general solution of the differential equation is 


v(x) = eycos VAr + c2 sin VA. 


Noting that ay = 8, = 1 and ag = ə = 0, we seek the solution vı which 
satisfies the following initial conditions at x = 0, 


v1(0) = ag = 0, vi (0) = ~a; = -1. 
These conditions imply 
Cc, = 0, VXc9 = -1. 
Hence 


re 
v1 (a2) = a sin VAs. 


Similarly, the solution vg which satisfies the conditions 
v2(1) = B, =0, v9(1) =-f$,=-1, 


is readily found to be 


2 sin VA cos VÀ . 
va(x) = Tx cos V Ar — x sin VÀz. 


Now Green’s function, according to (2.39), is given by 


| [| otu (Evlr), OSETILI, 
AS { eum); Csaest< 1, 


The constant c can be computed using (2.39), and is given by sin V\/VX. Thus 


SEE Asin VÀz — sin X cos VAs) , O<E<T<], 
G= VAsinyd 
; sin VAr ; : 
ran (os Asin VAE — sin X cos VE) , O<r<é<l. 
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Note that 


Ge(x, £t) — Gelz, 27) = sin VAr (cos Acos VArt + sin VXsin Vat) 
sin VÀ 


= cos VAT- (cos Asin VAT — sin VÀ cos VAs) 
sin VÀ 


=1 


The other properties of G can easily be checked. 


An infinite set F of functions defined and continuous on [a,b] is said to be 
equicontinuous on [a, b| if, given any € > 0, there is a ô > 0, which depends 
only on €, such that 


x,€ € [a,b], je- él< => |f(a)—- f()| <e for all f € F. 


This condition clearly implies that every member of an equicontinuous set of 
functions is a uniformly continuous function on [a,b]; but, more than that, the 
same 6 works for all the functions in F. The set F is uniformly bounded if there 
is a positive number M such that 


\f(x)| <M for all f € F, x € [a,b]. 


According to the Ascoli-Arzela theorem [14], if F is an infinite, uniformly 
bounded, and equicontinuous set of functions on the bounded interval [a,b], 
then F contains a sequence (fn : n € N) which is uniformly convergent on 
[a,b]. The limit of fn, by Theorem 1.17, is necessarily a continuous function on 
[a,b]. 


Lemma 2.24 


Let T be the integral operator defined by Equation (2.41). The set of functions 
{Tu}, where u € C(|a,b]) and |u|] < 1, is uniformly bounded and equicontin- 
uous. 


Proof 


Green’s function G is continuous on the closed square [a,b] x [a,b], therefore 
|G(a, €)| is uniformly continuous and bounded by some positive constant M. 
From (2.41) and the CBS inequality, 


|Tu(a)| = (G(@, €), u(S))] < Mvb—alull, 
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hence the set {|Tu| : ||u|| < 1} is bounded above by Mb — a. Because G is 
uniformly continuous on the square [a,b] x [a,b], given any € > 0, there is a 
ô > 0 such that 


21,2 € [a,b], |eo — xı| < 6 > |G(x2, £) — G(a1,§)| <€ forall ¿€ [a,b]. 
If u is continuous on [a,b], then 


|z2 — zı] <6 => |Tu(x2) — Tu(x1)| < eVb—allull. 


Thus the functions {Tu : u € Cla, b], ||ul] < 1} are equicontinuous. 


The norm of the operator T, denoted by ||T||, is defined by 
||Tl| = sup{||Tul] : u € C(fa, b]), llul] = 1}. (2.42) 
From this it follows that, if u Æ 0, then 
[Pel] = [T (u/ huD] Mell < ITI ul. 


But because this inequality also holds when u = 0, it holds for all u € C (Ja, b]). 
It is also a simple exercise (see Exercise 2.33) to show that 


IT| = sup |(Tu,u)|. (2.43) 


lul=1 


Now we can state our first existence theorem for the eigenvalue problem 
under consideration. 


Theorem 2.25 
Either ||T|| or — ||T|| is an eigenvalue of T. 


Proof 
|Z] = sup{|(Tu,u)}| : u € C(Ja,b]), ||ul| = 1} and (Tu, u) is a real number, 
hence either ||T|| = sup (Tu, u) or ||T|| = —inf (Tu,u), ||ul| = 1. Suppose 


||7'|| = sup (Tu, u). Then there is a sequence of functions up E€ C([a,6]), with 
|ux|| = 1, such that 
(Tuk, Ur) > ||T'|| as k > oo. 


Since {Tux} is uniformly bounded and equicontinuous, we can use the Ascoli- 
Arzela theorem to conclude that it has a subsequence {Tux } which is uniformly 
convergent on [a,b] to a continuous function yy. We now prove that pọ is an 
eigenfunction of T corresponding to the eigenvalue fig = ||T'|| . 
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As j — œ we have 


sup |Tus (2) — yo()| > 0, 
x€ [a,b] 


which implies 
|Tuk; — Yo|| > 9, (2.44) 


and therefore || Der, | > ||%o||. Furthermore, because (Tux,, uk, ) > Ho; 
|| Pun, — Kovk; I" = ||P, I? + Ho = 2 Lo (Tuk; Uk} S? leol? = Hô» (2.45) 


hence ||poll? > 2 > 0, and the function y, cannot vanish identically on [a,b]. 
Because ||Tux, |? < IITII? |u; |? = 2, it also follows from (2.45) that 


0 < ||Tuk, — pour; ||" < 2u? — 2m0 (Tur,,ur,)- 
But (Tup,,Uk;) > Ho, hence 
|| Te; — uour; || > 0. (2.46) 
Now we can write 


0 < ||[T po — Ho¥oll 
< ||Teo — T(Tux,)|] + |T (Tur) — HoT Ue, || + [|MoT ux; — Ho¥ol] - 


In the limit as 7 — oo, using the inequality ||T'ul] < ||T|| |u|] together with 
(2.44) and (2.46), we conclude that ||T po — Hoyoll = 0. Tyo — Hopo being 
continuous, this implies Ty (x) = poyo(zx) for all x € [a,b]. 


If ||Z']| = — inf (Tu, u), a similar argument leads to the same conclusion. 
Let 
Po 
po = , 
° Iioll 


Gi (a, €) = G(x, £) -r UoWo(x) vo (8), 
b 


(Tiu)(x) = |] Gila, §)u(§)d& 


a 


= Tu(x) — uo (u, Yo) Vo(x) forall u € C(|a, b]). (2.47) 


The function G has the same regularity and symmetry properties as G, hence 
Lemma 2.24 and Theorem 2.25 clearly apply to the self-adjoint operator 7}. If 
|ITi|| 4 0 then we define 


sup{| (Tiu, u)| : u € C(fa,b]), llull = 1} = [el, 
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where u is a (nonzero) real number which, by Theorem 2.25, is an eigenvalue 
of T; corresponding to some (nonzero) eigenfunction y € C([(a, b]); that is, 


Tipi = H191- 
Let Yı = Y1/ l|pı|| Then, for any u € C([a,b]), 


(Tiu, Vo) = (Tu, Vo) = Ly ((u, Wo) Wo; Wo) 
a Tho) — Ho (u, Yo) 


since Tyo = foto. In particular, (Ti~,,%) = (11,0) = 0, and therefore 
p; is orthogonal to po. Now Equation (2.47) gives 


Ty, = Tiy = mt. 
Thus 7, is also an eigenfunction of T, and the associated eigenvalue satisfies 


al = [TY] < ITI = laol- 


Setting 


Go(x,€) = Gi(x,€) — mY (x) v1 (£) = G(x, $) — Po Ugh, (2)d,(E) 


Tou = Tru — py (u, %1) Yı = Tu — z Hp (U, Ve) Vr» 


and proceeding as above, we deduce the existence of a third normalized eigen- 
function Y» of T associated with a real eigenvalue us such that ps is orthogonal 
to both Yo and 4%; and |u| < |u|. Thus we obtain an orthonormal sequence of 
eigenfunctions Yo, Y1, Y2,- -- of T corresponding to the sequence of eigenvalues 


lio] > jual > |u| > .... The sequence of eigenfunctions terminates only if 
||Ta|| = 0 for some n. In that case 
n—1 n—-1 
0 = LT,u= LTu- 5 by (u, Yk) Ly, = u- Xe Hp (U, Ve) LY, 
k=0 k=0 


for all u € C({a, b]), which would yield 


n=l n—i1 
u= 2 Hr (u, Ve) LYE = `X (u, Yp) LT Yp = 5 (u, Vr) Vr- 
k=0 k=0 


But because no finite set of eigenfunctions can span C([a,b]), this last equality 
cannot hold for all u € C({[a, b]). Consequently, ||Z;,|| > 0 for all n € N, and we 
have proved the following. 
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Theorem 2.26 (Existence Theorem) 


The integral operator T defined by Equation (2.41) has an infinite sequence of 


eigenfunctions (~,,) which are orthonormal in £?(a, b). 


2.4.2 Completeness of the Eigenfunctions 


If f is any function in £?(a,b) then, by Bessel’s inequality (1.22), we have 


SoA ved? SIF? 


k=0 


To prove the completeness of the eigenfunctions (7,,) in £?(a,b) we have to 
show that this inequality is in fact an equality. This we do by first proving that 


f= So (F Yr) ve 


> 
Il 
© 


for any f € C?([a,b]) which satisfies the boundary conditions (2.35) and (2.36), 
and then using the density of C?({a,b]) in £?(a,b) to extend this equality to 
L? (a,b). 


Theorem 2.27 


Given any function f € C?([a, b]) which satisfies the boundary conditions (2.35) 
and (2.36), the infinite series X` (f, Yp) Yg converges uniformly to f on [a,b]. 


Proof 


For every fixed x € [a,b], we have 
(Gle, ), Ya) = TH) = ppid,(a) for all k €N. 


Bessel’s inequality, applied to G as a function of €, yields 
n b 
Sies f CEOP forarea, nen: 
k=0 j 
Integrating with respect to x and letting n — oo, we obtain 


doe S M?(b—a)?, (2.48) 
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where M is the maximum value of |G(x,&)| on the square [a,b] x [a,b]. An 
immediate consequence of the inequality (2.48) is that 
lim |u| = 0. (2.49) 
With 


Gr(x,€) = -5 Meh), (@ 


we have already seen that the integral operator 


b 
Ta iula) | Gule, EdE u€ Cla), 
has eigenvalue u„ and norm 


[Tall = [onl - 
Therefore, for any u € C([a,b]), 


[Taul] = 


uF mlu ubr) Vell <[enl (lull +0 asn—oo (2.50) 


in view of (2.49). If n > m, then 


ee ees ( a (usta) va) 2 


k=m =e 


but because |Tu| < MVb — a ||u]|| for all u € C([a,b]), it follows that 


5 Hp (U, Vk) Vk 


k=m 


< ITI [= Tine) 


i 1/2 
suva (y pa l 


k=m 


By Bessel’s inequality, the right-hand side of this inequality tends to zero as 
m, n — œ, hence the series 


oe Hp (u,b 
k=0 


converges uniformly on [a,b] to a continuous function. The continuity of Tu 
and (2.50) now imply 


r= ` lp (u, Ypk) Ygl) forall x € [a,b]. (2.51) 


k=0 
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If f € C?({a,b]) satisfies the boundary conditions (2.35) and (2.36), then 
u = Lf is a continuous function on [a,b] and f = Tu. Since 


Hk (u, Yk) = (u, Ux Vx) = (u, Tz) a (Tu, Px) = (f, Vk) ’ 
Equation (2.51) yields 


F) = 2 (F Vr) Yee) forall x € [a,b]. 


Ms 


it 
| 


0 


Now we need to prove that any function f in £?(a,b) can be approxi- 
mated (in the £? norm) by a C? function on [a,b] which satisfies the sepa- 
rated boundary conditions (2.35) and (2.36). In fact it suffices to prove the 
density of C?({a,b]) in £?(a,b) regardless of the boundary conditions; for 
if g € C?({a,b]) then we can form a uniformly bounded sequence of func- 
tions gn E€ C({a,b]) such that g,(z) = g(x) on [a + 1/n,b — 1/n] and 
gnla) = gi (a) = gn(b) = gi,(b) = 0. Such a sequence clearly satisfies the 
boundary conditions and yields 


Tim |g ~ gall = 0. 


If f € £L7(a,b), we know from Theorem 1.27 that, given any £ > 0, there is 
a function h, continuous on [a,b], such that 


lf — All < e. (2.52) 


Hence we have to prove that, for any h € C([a,b]), there is a function g € 
C?({a, b]) such that 
|k- g|| <e. (2.53) 


This can be deduced from the Weierstrass approximation theorem (see [1] or 
[14]), but here is a direct proof of this result. 
Define the positive C° function 


where 


oo 1 =T 
= — +—_]d 
S | 
so that f°. a(x)dx = 1. Then we define the sequence of functions 


An(x) = na(nz), neN, 
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which also lie in C°°(IR). Because @ vanishes on |a| > 1, each function an 
vanishes on |x| > 1/n, and satisfies 


J An(x)dx = / a(x)dx =1 for all n. 


—oo —Co 


Finally, we extend h as a continuous function from [a,b] to R by setting 


0, xr<a-l,x2>b4+1 
h(x) = h(a)(x —a+1), a-l<a<a 
h(b)(—a + b+ 1), b<a<b+l, 
and define the convolution of h and a, by 
(hrane) =f haanle -y)dy 


x+(1/n) 
= / hy)on(e — y)dy 


With a, E€ C™ the function h * a, is also a C% function on R. Furthermore, 


1/n 
|h(a) — (h * an)(a)| = if [h(a — y)an(y)dy — h(x)an(y)|dy 


—1/n 


< sup{|A(x — y) — h(2)| : x € [a,b], |y] < 1/n}. 


Because h is continuous on R, it is uniformly continuous on any finite real 
interval, and therefore, as n —> ov, 


|h(x) — (h * an) (x)| < sup{|h(a — y) — h(x)| : x € [a,b], |y] < 1/n} > 0. 


We have actually shown here that C%, which is a subset of C?, is dense in 
C([a, b]) in the supremum norm, which is more than we set out to do. Conse- 
quently, there is an integer N such that 


|h-hxay||<e foraln >N. 
By choosing the function g in (2.53) to be h * ay, we obtain 
If = gll Sf — All + [lh — gl] < 2e (2.54) 


for any f € £L7(a,b). Using the triangle and Bessel inequalities, we also have 


n 


<|If-all+ b- Sath 


k=0 


|- È ve + | 0- ryw 
k=0 k=0 


zař- al b- Et) (2.55) 
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In view of Theorem 2.27, we can make ||g — g-o (9, Wk?) Vell < € by choosing 
n large enough, say n > M. This, together with (2.54) and (2.55), now imply 


lr 5 (Fs Yk) Wel] < 5e for all n > max{ N, M}. 
k=O 


We have therefore proved the following. 
Theorem 2.28 (Completeness Theorem) 
If f € L?(a,b), then 


Tim l- E (fot) Ua = 0. (2.56) 


Equation (2.56) is equivalent to the identity 


OO 


F= 2 Vk) Dr 


k=0 


where the equality is in £?(a, b). It is also equivalent to Parseval’s relation 


WFP = SoA eel? 
k=0 


Each one these equations expresses the fact that the sequence (Y) of ortho- 
normal eigenfunctions of T forms a complete set in £7(a, b). 


Going back to the SL problem 
Lu + Au = 0, 


ayu(a) + agu'(a) = 0, 


B,u(b) + Bau (b) = 0, 


where u € C?([a, b]), we have shown that this system of equations is equivalent 
to the single integral equation 


Tu = pu = —u/d. 


The boundary conditions determine a unique solution to the differential equa- 
tion Lu + Au = 0, therefore each eigenvalue À of —L corresponds to a unique 
eigenfunction u. This is equivalent to saying that each eigenvalue u of T corre- 
sponds to a unique eigenfunction u. 

By (2.49), 1/|An| = |un| > 0, and it follows from Lemma 2.22 that An — oo 
as n — oo. Reintroducing the weight function p, we therefore arrive at the 
following fundamental theorem. 
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Theorem 2.29 


Assuming p',r,p E C([a,b]), and p,p > 0 on [a,b], the SL eigenvalue problem 
defined by Equations (2.34) to (2.36) has an infinite sequence of real eigenvalues 


ro < Ay < Ag <- 


such that A, — oo. To each eigenvalue An corresponds a single eigenfunction 
Pn, and the sequence of eigenfunctions (y,, : n € No) forms an orthogonal basis 
of £2(a,b). 


Remark 2.30 


1. If the separated boundary conditions (2.35) and (2.36) are replaced by the 
periodic conditions 
u(a) = u(b),  w'(a) = u’ (b), 


it is a simple matter to verify that Equation (2.26) is satisfied provided p(a) = 
p(b) (Exercise 2.26). The operator defined by (2.33) is then self-adjoint and its 
eigenvalues are bounded below by — max {|r(x)| : a < x < b} (see Equation 
(2.38)). The results obtained above remain valid, except that the uniqueness 
of the eigenfunction for each eigenvalue is not guaranteed. This is illustrated 
by Example 3.1 in the next chapter. For more information on the SL problem 
under periodic boundary conditions see [6]. 


2. It can also be shown that each eigenfunction y,, has exactly n zeros in the 
interval (a,b), and it is worth checking this claim against the examples which 
are discussed henceforth. A proof of this result may be found in [6]. 


3. The interested reader may wish to refer to [18] for an extensive up-to-date 
bibliography on the SL problem and its history since the early part of the 
nineteenth century. 


Example 2.31 


Find the eigenvalues and eigenfunctions of the equation 
u” + àu =0, 0<a<l, 


subject to one of the pairs of separated boundary conditions 
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Solution 


The differential operator is formally self-adjoint and the boundary conditions (i) 
and (ii) are homogeneous and separated, so both Theorems 2.14 and 2.29 apply. 
We have already seen in Example 2.16 that, under the boundary conditions (i), 
the equation u” + Au = 0 has only positive eigenvalues. Under the boundary 
conditions (ii) it is a simple matter to show that it has no negative eigenvalues. 
This is consistent with Remark 2.30 in as much as r(x) = 0. 

If A > 0, the general solution of the differential equation is 


u(x) = cycos Vx + co sin VAx. (2.57) 
Applying conditions (i), we obtain 


u(0) Hq = 0, 
nn? 


u(l) = casin VAL = 0 = An = Sy, neN. 


In accordance with Theorem 2.29 the eigenvalues An tend to oo, and the cor- 
responding eigenfunctions 


Un(x) = sin me 


l 


are orthogonal in £?(0,1), because, for all m Æ n, 


l l 
1 
J sin esn dz = F [cos(m n)a cos(m +4 naa dx = 0. 
Reet i 2 Jo i i 


From Theorem 2.29 we also conclude that the sequence of functions 


_ nt 
sin "2, neéeN, 


is complete in £7(0, 1). 
Now we turn to the boundary conditions (ii): if A = 0, the general solution 
of u” = 0 is 
u(x) = c£ + C2, 


and conditions (ii) imply cı = 0. u is therefore a constant, which we can take 
to be 1. Thus 
Xo = 0, Uo(x) =1 


is the first eigenvalue—eigenfunction pair. But if A > 0, the first of conditions 
(ii) applied to (2.57) gives 


u (0) = VAc2 = 0, 


hence c2 = 0. The condition at x = l yields 
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n?n? 
u' (1) = —c1 V Asin VAL = 0 > An = 7 


Un(x) = cos 2, neN. 


Thus the eigenvalues and eigenfunctions under conditions (ii) are 


nn? 


y= SS; Un(x) = cos eae n € No. 
4 l 
Again we can verify that the sequence (cos(nr/lx) : n € No) is orthogonal in 
£7(0,1), and we conclude from Theorem 2.29 that it also spans this space. 


According to Example 2.31, any function f € £7(0,1) can be represented 
by an infinite series of the form 


f(x) = > ancos oa (2.58) 

n=0 

or of the form - 
f(x) = Y basin ai (2.59) 

n=1 


the equality in each case being in £7(0,1). Note that these two representations 
of f(a) do not necessarily give the same value at every point in [0,1]. For 
example, at x = 0, we obtain f(0) = $> o an in the first representation and 


f(0) = 0 in the second. That is because Equations (2.58) and (2.59) are not 
pointwise but £7(0,/) equalities, and should therefore be interpreted to mean 


bo- >> ancos Ta =0, Fo- X basin m =0, 
n=0 n=0 
respectively. 
By Definition 1.29, the coefficients an and b, are determined by the formulas 
re (f, AA, A pa (f, ney , 
||cos(nza /1)|| ||sin(nza /1)|| 
Because 
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the coefficients in the expansions (2.58) and (2.59) are therefore given by 


l 

ao = if f(a)dz, (2.60) 
l 

An = ah f(x)cos he dx, (2.61) 
l 

bn = n f(a)sin a dx. (2.62) 


Consider, for example, the constant function defined on [0,/] by f(a) = 1. 
For every n € N, 


l 
(Lsin(ara/)) = f sin de = E a AN 


NT NT 


Therefore bn = (2/n7)(1 — (—1)") and we can write 


DEE EE ca eee Cea 
1=25°( : sin To 


n=1 


4f. J'a; 1 
= — |sing + -sin 3z + -sin 5z +- |. 
T 3 5 


As pointed out above, this equality should, of course, be understood to mean 


2 (1-(-1)F\ kr 
1 > i )sin =0, 
or 
9 (=>) R na 
2 sin — zx > 1 as n —> ov. 
T k l 
k=1 
On the other hand, since 
l 
(1, cos(nrx/l)} = f cos(nne tae = { : neN 
0 ; n= 
I1? = 2, 


the coefficients an in the expansion (2.58) all vanish except for the first, which is 
ao = 1, hence the series reduces to the single term 1. But this is to be expected 
because the function f coincides with the first element of the sequence 


2 
(cos ZZ NE No) = (1, cos ZE, cos =) , 


By the same token, based on Theorem 2.29, the sequence of eigenfunctions 
for the SL problem discussed in Example 2.21 is complete in Le, b), where 
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p(x) = 1/x. Hence any function f € £}(1,b) can be expanded in a series of the 


form 
= : nT 
5 bn sin (5 log z) ; 


n=1 


where 
(f, sin(nm log x/ log b)) , 
"— |Isin(n log x/log b)||> ` 


2.4.3 The Singular SL Problem 


We end this section with a brief word about the singular SL problem. In the 
equation 


Lu + Apu = (pu'Y + ru + Apu = 0, a<xx< b, 


where p is smooth and p is positive and continuous, we have so far assumed that 
p does not vanish on the closed and bounded interval [a,b]. Any relaxation of 
these conditions leads to a singular problem. In this book we consider singular 
SL problems which result from one or both of the following situations. 


1. p(x) =0 at x =a and/or z = b. 
2. The interval (a,b) is infinite. 


In the first instance the expression pp(f’g— fg’) vanishes at the endpoint where 
p = 0, hence no boundary condition is required at that endpoint. In particular, 
if p(a) = p(b) = 0 and the limits of u at a and b exist, then the equation 


ppl f'g — fg) È =0 (2.63) 


is satisfied and L is self-adjoint. If (a,b) is infinite, then it would be necessary 
for \/pu(a) to tend to 0 as |x| — oo in order for u to lie in £2}. The conclusions 
of Theorem 2.29 remain valid under these conditions, although we do not prove 
this. For a more extensive treatment of the subject, the reader is referred to [15] 
and [18]. The latter, by Anton Zettl, gives an up-to-date account of the history 
and development of the singular problem and some recent results of a more 
general nature than we have discussed in this book. In particular, using the 
more general methods of Lebesgue measure and integration, the smoothness 
conditions imposed on the coefficients in the SL equation may be replaced by 
much weaker integrability conditions. 
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In the chapters below we consider the following typical examples of singular 
SL equations. 


1. Legendre’s Equation 


(1 — 2?)u” — 2ru’ + n(n+1)u=0, -l<a«<l, 


where \ = n(n + 1) and the function p(x) = 1 — x? vanishes at the end- 


points x = +1, hence only the existence of lim u at x = +1 is required. 


2. Hermite’s Equation 


u” — Qeu' + 2nu = 0, ceER. 


x 


After multiplication by e7 af this equation is transformed to the standard 


self-adjoint form 
e? u" — 2xe-* u + Ine" u = 0, 


2 


and p(x) = e77. 


where \ = 2n, p(x) = e7 


3. Laguerre’s Equation 
gu” — (1 — xu’ + nu = 0, x > 0, 
which is transformed to the standard form 
xe “u" — (1— x)e *u' +ne *u=0 


by multiplication by e~”. Here p(x) = xe~* vanishes at x = 0. 


4. Bessel’s Equation 


2 
n 

cu’ +u — —u + àzu = 0, x >o0. 
x 


Here both functions p(x) = x and p(x) = x vanish at x = 0. 


The solutions of these equations provide important examples of the so-called 
special functions of mathematical physics, which we study in Chapters 4 and 5. 
But before that, in Chapter 3, we use the regular SL problem to develop the £? 
theory of Fourier series for the trigonometric functions. The singular problem 
allows us to generalize this theory to other orthogonal functions. 


90 2. The Sturm-—Liouville Theory 


EXERCISES 
2.23 Determine the eigenvalues and eigenfunctions of the boundary-value 
problem 
u” + \u=0, a<a<b, 
u(a) = u(b) = 


2.24 Determine the eigenvalues and eigenfunctions of the equation z?°u” — 
cu’ + àu = 0 on (1, e) under the boundary conditions u(1) = u(e) = 
0. Write the form of the orthogonality relation between the eigen- 


functions. 


2.2 


OU 


Verify that p(f’g — fi. = 0 if f and g satisfy the separated ho- 
mogeneous boundary conditions (2.35) and (2.36). 


2.26 Verify that p( f'g — fx. = 0 if f and g satisfy the periodic condi- 
tions u(a) = u(b), u’(a) = u'(b), provided p(a) = p(b). 


2.27 Which of the following boundary conditions make p(f’g — f gÈ = 


0? 
(a) ple) =1, a<z <b, u(a) = u(b), w(a) = u'(b). 

(b) p(z) =2, 0<a< x< b, u(a) = u’ (b) = 0. 

(c) p(x) =sing, 0< £< r/2, u(0)=1, u(m/2) =0. 

(d) p(x) =e-*, 0< a <1, u(0) = u(1), w(0) = u'(1) 

(e) p(z) = 22, 0<a <b, u(0) =u (b), u(b) = w (0) 

(£) p(x) =22, -1 <x <1, u(—1) = u(1), w(-1) = v (1) 


2.28 Which boundary conditions in Exercise 2.27 define a singular SL 
problem? 


2.29 Determine the eigenvalues and eigenfunctions of the problem 
[(e@+3yy!+Ay=0, -2<2<1, 
y(—2) = y(1) = 0. 


2.30 Let 
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(a) Show how the positive eigenvalues \ may be determined. Are 
there any eigenvalues in (—oo, 0]? 


(b) What are the corresponding eigenfunctions? 


2.31 Discuss the sequence of eigenvalues and eigenfunctions of the prob- 
lem 


2.32 Let 


(pu) + ru + àu = 0, a<a<b, 
u(a) = u(b) = 0. 


If p(x) > 0 and r(x) < c, prove that A > —c. 


2.33 Using Equation (2.42), prove that the norm of the operator T is also 
given by 
ITI = sup Kun) 
where u € C({(a, 6]). Hint: |(Lu,u)| < ||T|| by the CBS inequality 
and Equation (2.42). To prove the reverse inequality, first show that 
2Re (Tu, v) < (llull? + |lv||?) sup |(Tu, u)|, then set v = Tu/ ||Tull . 


This page intentionally left blank 


3 


Fourier Series 


This chapter deals with the theory and applications of Fourier series, named 
after Joseph Fourier (1768-1830), the French physicist who developed the series 
in his investigation of the transfer of heat. His results were later refined by 
others, especially the German mathematician Gustav Lejeune Dirichlet (1805- 
1859), who made important contributions to the convergence properties of the 
series. 

The first section presents the £? theory, which is based on the results 
of the regular SL problem as developed in the previous chapter. We saw in 
Example 2.31 that, based on Theorem 2.29, both sets of orthogonal functions 
{cos(nma/l) : n € No} and {sin(nza/l) : n € N} span the space £?(0,1). We 
now show that their union spans £7(—1,1), and thereby arrive at the funda- 
mental theorem of Fourier series. 

In Section 3.2 we consider the pointwise theory and prove the basic result 
in this connection, which is Theorem 3.9, using the properties of the Dirichlet 
kernel. The third and final section of this chapter is devoted to applications to 
boundary-value problems. 


3.1 Fourier Series in L? 


A function y : [-1,l] — C is said to be even if 
y(—x) = (x) for all x € [-, J], 


and odd if 
y(—x) = —y(x) for all x € [-I,]]. 
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The same definition applies if the compact interval [—1, 1] is replaced by (—1, l) 
or (—o0, 00). If ọ is integrable on [—I, l], then it follows that 


[ “nie af aoa 


A l A= 


if y is odd. Clearly, the sum of two even (odd) functions is even (odd), and the 
product of two even or odd functions is even, whereas the product of an even 
function and an odd function is odd. 

An arbitrary function f defined on [—I,/] can always be represented as a 
sum of the form 


if y is even, and 


N| = 


f(z) = (f(a) + f2) + SI) f(-2)], (3.1) 


where f(x) = $[f(x)+f(—2)] is an even function and f,(x) = $[f(x) — f(—2)] 
is an odd function. fe and fo are referred to as the even and the odd parts 
(or components) of f, respectively. To show that the representation (3.1) is 
unique, let 


f(a) = læ) + fola) = fez) + folz), (3.2) 
where fe is even and Í is odd. Replacing x by —x gives 
f(—2) = fe(x) = fo(x) = fe(x) — fo(z). (3.3) 


Adding and subtracting (3.2) and (3.3) we conclude that A = fe and fo =n for 

Any linear combination of even (odd) functions is even (odd), therefore the 
even and odd functions in £?(—1,/) clearly form two complementary subspaces 
which are orthogonal to each other, in the sense that if y,~ € L?(—I,1) with 
y even and w odd, then y and w are orthogonal to each other, for their inner 
product is 


l 
(w= | oade = 0 
because the function yy is odd. 


Let f = fe + fo be any function in £?(—1,1). Clearly both fe and fo belong 
to £2(—1,1). The restriction of fe to (0,1) can be represented in £? in terms of 
the (complete) orthogonal set {cos(n7a/l) : n € No} as 
nT 
— r, 


fe(x) = ao + 5 an cos — 


n=1 


(3.4) 
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where, according to the formulas (2.60) and (2.61), 


l 
= a felx)dz 


2 l 
= F fe(x)cos Mi dz, neN. 
To i 
The orthogonal set {sin(nra/l) : n € N} also spans £7(0,1), hence 
Jols) = Ñ bn sin ae, 0<2<l, (3.5) 
where ; 
2 
= a fo(x)sin oe dz, KEN. 
0 


Because both sides of (3.4) are even functions on {—1, l], the equality extends 
o [—1, l]; and similarly the two sides of equation (3.5) are odd and this allows 
us to extend the equality to [—1, l]. Hence we have the following representation 
of any f € £7(-I,1), 


= . NT 
f(a) = ao + > AnCOS Me + 2 bn sin a 


= ao + = (ancos 2 — +0, sin a) : -l<a2<il, (3.6) 


the equality being, of course, in £?(—1,/). On the other hand, using the prop- 
erties of even and odd functions, we have, for all n € No, 


l l 
/ fe(x)sin T? dz = / fo(x)cos J? dz = 0, 
zf —l 


l l l 
1 1 
i fe(x)cos 2T ede = 5J fe(x)cos 2T ede = 5J f(x)cos a dx, 
a i 2J i 2J i 
l l l 
1 1 
f fo(x)sin “adr = 5) fo(x)sin “2 dz = F f(a)sin PT r da. 
f i 2), i 2J, i 


Hence the coefficients a, and bn in the representation (3.6) are given by 
1 fr 
ao = z | Fed 
1 f! nT 
=a f(x)cos T7 dz, neéN, 
1 


l 
= al f(ax)sin a dx, neéN. 
-i 


This result is confirmed by the following example. 
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Example 3.1 
Find the eigenfunctions of the equation 
u” + u=0, —l<ax<l, 


which satisfy the periodic boundary conditions 


Solution 


This is an SL problem in which the boundary conditions are not separated but 
periodic. Because the operator —d?/dx? is self-adjoint under these conditions 
(see Exercise 2.26), the conclusions of Theorem 2.29 still apply (except for the 
uniqueness of the eigenfunctions), as pointed out in Remark 2.30, hence its 
eigenfunctions are orthogonal and complete in £2(—I, 1). If A < 0 it is straight- 
forward to check that the differential equation has only the trivial solution 
under the given periodic boundary conditions. If A = 0 the general solution is 
u(x) = c1@ + c2, and the boundary conditions yield the eigenfunction u(x) = 1. 
For positive values of A the solution of the equation is 


C1 COS VAT + csin VAT. 
Applying the boundary conditions to this solution leads to the pair of equations 
cosin VM = 0, 
ci vÀ sin VAL = 0. 


Because VX > 0 and cı and cz cannot both be 0, the eigenvalues must satisfy 
sin VA l = 0. Thus the eigenvalues in [0,00) are given by 


Dae jy; n € No, 


and the corresponding eigenfunctions are 
27 3T 3T 


T _ 0 i 

7% Singe, cos- 7%) CO8 T, sin T2, 

Note that every eigenvalue Àn = (n7/l)? is associated with two eigenfunctions, 
cos(nrz/l) and sin(nrz/l), except Ao = 0 which is associated with the single 
eigenfunction 1. This does not violate Theorem 2.29 which states that each 
eigenvalue corresponds to a single eigenfunction, for that part of the theorem 


1, cos x, sin 


only applies when the boundary conditions are separated. In the example at 
hand the boundary conditions yield the pair of equations 


eV Asin VAL = 0, csin V Al = 0, 
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which do not determine the constants cı and cp when A = (n7/1)?, so we can 
set cı = 0 to pick up the eigenfunction sin(nma/l) and c2 = 0 to pick up 
cos(nrz/l). 

We have therefore proved the following. 


Theorem 3.2 (Fundamental Theorem of Fourier Series) 


The orthogonal set of functions 


{1, cos Tr, sin T :n EN} 


is complete in £?(—1,1), in the sense that any function f € £?(—l,1) can be 
represented by the series 


nT 
= ao + 2 (ancos 2 — 2 + basin — i ar) ; =p sl, (3.7) 
where 
a ce) gee i 
“=p Ia oe 
te (f, cos(nma/l)) JE f(a)cos oe dz, neN, (3.9) 
cos(nrz/l) \| 
AR (f,sin(na/l)) =I f a f(x x)sin "2 dz, nEeN. (3.10) 
sin(nrz/)|? 


The right-hand side of Equation (3.7) is called the Fourier series expansion 
of f, and the coefficients an and bn in the expansion are the Fourier coefficients 


of f. 


Remark 3.3 


1. If f is an even function, the coefficients b, = 0 for all n € N and f is then 
represented on [—1,/] by a cosine series, 


Co 
nT 
x) = ao + > GnC08 5-2, 
n=l 


in which the coefficients are given by 


I 2 l 
= if f(x)dx, an = F f(x)cos ar dx, neN. 
0 0 
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Conversely, if f is odd, then a, = 0 for all n € No and f is represented by a 
sine series, 


L nn 
f(x) = 2 bnsin Th 


where E 
bn = al f(a)sin Fe dx, n EN. 
0 


2. The equality expressed by (3.7) between f and its Fourier series is not nec- 
essarily pointwise, but in £?(—I,/). It means that 


NT 


N 2 
Vo = feo FY (ancos x + bnsin Ta)]| | = 
n=1 l l 
2 2 N 2 2 
IAP = [oleo + $ (onl? + P| > 0 as N = o. 
n=1 


Because f € £°(—I,1), this convergence clearly implies that the positive series 
© |an|?and X |bn|? both converge and therefore lim an = limb, = 0. 

3. If the Fourier series of f converges uniformly on [—I,/], then, by Corollary 
1.19, its sum is continuous and the equality (3.7) is pointwise provided f is 
continuous on [—1, j]. 


Example 3.4 
The function 
-1, —™< “<0 
f(z)= 5 9, r= 
1, O<a<7, 
shown in Figure 3.1, 
Yi 
y= f(x) 
| oe 


Figure 3.1 


3.1 Fourier Series in £? 99 


is clearly in £2(—z,7). To obtain its Fourier series expansion, we first note 
that f is odd on (—7,7), hence its Fourier series reduces to a sine series. The 
Fourier coefficients are given by 


bn = a f(x)sin nx dx 


T 
2 T 

--| sin ng dx 
T Jo 
2 

ae 
zal cos n7) 


bn is therefore 4/na when n is odd, and 0 when n is even, and tends to 0 as 
n — oo, in agreement with Remark 3.3 above. Thus we have 


f(x) = bnsin nx 
n=1 
4. 4. 4, 
= — sinz +4 sin 3x 4 sin 5g +- 
T 3m 5r 
oe 1 
= 2 zy gq 22r + De. (3.11) 


Observe how the terms of the partial sums 


N 
4 1 f l 
Sy(a2) = > mi sin(2n + 1)x 


n=0 


add up in Figure 3.2 to approximate the graph of f. Note also that the Fourier 
series of f is 0 at +m whereas f(z) = 1 and f(—7) is not defined , which just 
says that Equation (3.11) does not hold at every point in [—7,, 7]. We shall have 
more to say about that in the next section. 


S(x) S(x) S,(x) 


Figure 3.2 The sequence of partial sums Sy. 


100 3. Fourier Series 


Theorem 3.2 applies to real as well as complex functions in £?(—1,1). When 
f is real there is a clear advantage to using the orthogonal sequence of real 
trigonometric functions 
T BTT 2T _ 2r 
x, sin =z, cos — z7, sin — g, --: 


l l l i 
to expand f, for then the Fourier coefficients are also real. Alternatively, we 


1, cos 


could use Euler’s relation 
e” = cosg + isin z, zER, 


to express the Fourier series in exponential form. To that end we define the 


coefficients 
1 
— = — d 
Co = ao z7 J to T, 
1 1 f! ; 
meres = ae —innre/l gy 
Cn 5 (An ibn) J [ foe T, 
1 : 1 : inne /1 
C-n = = (Gn T ibn) T f(x)e dz, ne N, 
2 2l Ja 
so that 
a nT nT 
n TS bn i ae ) 
ao + 2 (a cos i x + On Sin i x 


N 
= Co + ` [len + C_»)cos Ta + i(Cy — C-n )sin Ta] 


N N 
= co + > (a 4 gery = X cere, 
n=—N 


Because |c,| < |a@,| + |bn| < 2|cn]| for all n € Z, the two sides of this equation 
converge or diverge together as N increases. Thus, in the limit as N — oo, the 
right-hand side converges to the sum 


oo oo 
> cpe Ten = eects À (Cpe TEN Jepang OTE, 
n=1 


n=—Co 


and we obtain the exponential form of the Fourier series as follows. 


Corollary 3.5 


Any function f € £?(—I,1) can be represented by the Fourier series 


f(z) = 3 epe TEI (3.12) 


n=— oo 
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where ( ny i 
f, oes 1 innr 
a | inna/ijj? ~ 3] J foe "de, né Z. (3.13) 
e _ 


We saw in Example 1.12 that the set {e7 : n € Z} is orthogonal in 
L?(—1, T), hence {e'"**/! : n € Z} is orthogonal in £?(—1,1). We can also show 
that {e"7*/! : n € Z} forms a complete set of eigenfunctions of the SL problem 
discussed in Example 3.1, where each eigenvalue n?7?//? is associated with the 
pair of eigenfunctions e**"™*/!, and thereby arrive at the representation (3.12) 
directly. In this representation, the Fourier coefficients cn are defined by the 
single formula (3.13) instead of the three formulas for ao, an, and bn, which is 
a clear advantage. Using (3.12) and (3.13) to expand the function in Example 
3.4, we obtain 


1 7 ; 
Cn = z P f(aje de 
1 Eo h ae 
= — / e dg -J e=in2dz]| 
27 0 -T 
= — sinne dx 
in Jo 


1 
= — (1 — cosnr), 


int 
Pees T 

= 1 1 n In 

f(a) Ra (-1)"Je 

£ 2 Xo _ 1 feinz = eWi2nt)y 

iT 2n +1 
n=0 

Ae | 

= sin(2n + 1)z. 
Dear e 


EXERCISES 


3.1 Verify that the set {e’""*/! : n € Z} is orthogonal in £?(—I, 1). What 
is the corresponding orthonormal set? 


3.2 Is the series 


Io o, 
5 ti sin(2n + l)x 


uniformly convergent? Why? 


102 3. Fourier Series 


3.3 Determine the Fourier series for the function f defined on [—1, 1] by 
0, -l<a<0 
fe) ={ 


l, O<2<1. 


3.4 Expand the function f(x) = m — |x| in a Fourier series on [—7, 7], 
and prove that the series converges uniformly. 


3.5 Expand the function f(x) = x? + x in a Fourier series on [—2, 2]. Is 
the convergence uniform? 

3.6 If the series J` |an| converges prove that the sum X>} a,cosnax 
is finite for every x in the interval [—7z,7], where it represents a 
continuous function. 

3.7 Prove that the real trigonometric series $ (ancos nz + bnsin nx) con- 
verges in £?(—7, 7) if, and only if, the numerical series $` (a2 + b2) 
converges. 


3.2 Pointwise Convergence of Fourier Series 


We say that a function f : R — C is periodic in p, where p is a positive 
number, if 


f(£+p)= f(x) forall zeER, 
and p is then called a period of f. When f is periodic in p it is also periodic in 
any positive multiple of p, because 
f(x + np) = f(x + (n—1)p +p) 
= f(x + (n — 1)p) 


It also follows that 


f(x—np) = f(z -np +p) =---= f(z), 


hence the equality f(x + np) = f(x) is true for any n € Z. The functions sin x 
and cosa, for example, are periodic in 27, whereas sin(ax) and cos(ax) are 
periodic in 27/a, where a > 0. A constant function is periodic in any positive 
number. 

A function f which is periodic in p and integrable on (0, p] is clearly inte- 
grable over any finite interval and, furthermore, its integral has the same value 
over all intervals of length p. In other words 
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x+p p 
j f(t)dt = f(tj)dt forallaz eR. (3.14) 
x 0 


This may be proved by using the periodicity of f to show that f f(t)dt = 
L ioa (Exercise 3.8). 
When the trigonometric series 


Sin(£) = ao + D> (akcos kx + bysin kz) 
k=1 


converges on R, its sum 
Co 
S(x) = ao + D> (axcos ka + bysin kz) 
k=1 


is clearly a periodic function in 27, for 27 is a common period of all its terms. 
If ay, and by, are the Fourier coefficients of some L?(—7, 7) function f, that is, if 


1 T 
a0 = 5 f(x)dz, 
27 E 
1 T 
an =—] f(x)cosna dz, 
TJ ax 
1 [7 A 
b, = = f(x)sin nx dz, neéN, 
T Jin 


then we know that S,, converges to f in £L?(—7,7), 
2 
Sn & f. 


We now wish to investigate the conditions under which Sn converges point- 
wise to f on [—7,7]. More specifically, if f is extended from [—7,7] to Rasa 
periodic function by the equation 


f(a+2nr)= f(x) forallxzeR, 


when does the equality f(z) = S(x) hold at every x € R? To answer this 
question we first need to introduce some definitions. 


Definition 3.6 


1. A function f defined on a bounded interval I, where (a,b) C I C [a,b], is 
said to be piecewise continuous if 
(i) f is continuous on (a, b) except for a finite number of points {21, 72,...,¢n}, 
(ii) The right-hand and left-hand limits 

lim, f (xi) = f(z}), lim f(x) = f(z; ) 


Loa" Ln 
i i 
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L Ni 1 l 


a 6, E 6, b 


Figure 3.3 A piecewise smooth function. 


=y 


exist at every point of discontinuity x;, and 
(iii) lim,_.2+ f(x) = f(a") and limp- f(x) = f(b~) exist at the endpoints. 
2. f is piecewise smooth if f and f’ are both piecewise continuous. 


3. If the interval I is unbounded, then f is piecewise continuous (smooth) if it 
is piecewise continuous (smooth) on every bounded subinterval of I. 


It is clear from the definition that a piecewise continuous function f on a 
bounded interval is a bounded function whose discontinuities are the result of 
finite “jumps” in its values. Its derivative f’ is not defined at the points where f 
is discontinuous. f’ is also not defined where it has a jump discontinuity (where 
the graph of f has a sharp “corner”, as shown in Figure 3.3). Consequently, if 
f’ is piecewise continuous on (a,b), there is a finite sequence of points (which 
includes the points of discontinuity of f) 


a < E1 < 2 < E3 ORE < b, 


where f is not differentiable, but f’ is continuous on each open subinterval 


(a, £1), (€5,€:41); and (Em: b), 1 < a < m — 1. 
Note that f’(a*) are the left-hand and right-hand limits of f’ at x, given by 


“)—fla-h 

f'(a )= lim f'(x) lim, f(x ) a ) 
= 

f(a") = lim f'(x) = lim, LE £C ) 


which are quite different from the left-hand and right-hand derivatives of f at 
x. The latter are determined by the limits 


fn feth- FC) 
h—0= h 


2 
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which may not exist at €; even if f is piecewise smooth. Conversely, not every 
differentiable function is piecewise smooth (see Exercises 3.10 and 3.11). A con- 
tinuous function is obviously piecewise continuous, but it may not be piecewise 
smooth, such as 
f(æ)=vr, O<e<l, 

in as much as lim, _.9+ f'(x) does not exist. 

The next two lemmas are used to prove Theorem 3.9, which is the main 
result of this section. 


Lemma 3.7 


If f is a piecewise continuous function on [—7, 7], then 


T T T 
lim f(x)cosnadx = lim f(x)sinnadx = lim f(x)" de = 0. 
noo = noo zg noo -r 
Proof 
Suppose £1,..., £p are the points of discontinuity of f on (—r, mr) arranged in 
increasing order. Because |f| is continuous and bounded on (£p, #41) for every 
0 < k < p, where zo = —7 and rp41 = 7, it is square integrable on all such 


intervals and 


f roas fT de f O as 


T T Tp 


Consequently f belongs to L?(—r, r). Its Fourier coefficients an, bn, and cn 
therefore tend to 0 as n — œœ (see Remarks 1.31 and 3.3). 


Lemma 3.8 
For any real number a 4 2mr, m E Z, 
1 sin(n + $)a 


5 + cos a + cos 2a +---+ cos na = 


3.15 
2sin la ( ) 


Proof 


Using the exponential expression for the cosine function, the left-hand side of 
(3.15) can be written in the form 
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1 2n 
= _e ina X eika 
2 
k=0 
2n 


= soe be Cae 


k=0 
Because e’* = 1 if, and only if, œ is an integral multiple of 27, 
2n ïa (erent 


int ia 1 —ina 
€ ye ie = 9° T= eta 
k=0 


N| = 


1 eina rz, ei(ntl)a 


= 7 ; a#0,+27,.... 


2 1 — ee 


Multiplying the numerator and denominator of this last expression by e 


we obtain the right-hand side of (3.15). 


The expression 


n 


1 PRE een 
Dr(a)= 5 doe re a 


k=—-n 
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(3.16) 


—ia/2 
> 


known as the Dirichlet kernel, is a continuous function of @ which is even and 
periodic in 27 (Figure 3.4). Based on Lemma 3.8, the Dirichlet kernel is also 


represented for all real values of a by 


sin(n + 3)a 


<>; a#0,27,... 
D,(a) = 27 sin 5a 
2n+1 0.2 
= Ty see 
on ? Q 7 > 


Figure 3.4 The Dirichlet kernel D4. 
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Integrating D,(a) on 0 < a < r, we obtain 


$ 1 ("fl n 1 
J Dn(a)da = if (5 + J cos ka) da=-—- foralneN. (3.17) 
0 TJo \2 k 2 


Now we use Lemmas 3.7 and 3.8 to prove the following pointwise version of 
the fundamental theorem of Fourier series. 


Theorem 3.9 
Let f be a piecewise smooth function on [—7, 7] which is periodic in 27. If 
1 T 
a9 = 57 f(x)dx, 
2T] 
1 T 
an == | f(x)cosne dz, 
TJ-r 
1 [* ; 
bn ==] f(a)sinna dz, neN, 
TJ ix 


then the Fourier series 


Co 
S(x) = ao + J` (ancos na + bnsin nz), 
k=1 


converges at every x in R to $[f(xt) + f(x7)}. 


Proof 


The nth partial sum of the Fourier series is 


Sn(Z) = ao + X (axcos kx + bnsin kx). 
k=1 


n 


Substituting the integral representations of the coefficients into this finite sum, 
and interchanging the order of integration and summation, we obtain 


saif HO $ 3 


-1 f #@[5+ 


By the definition of D, we can therefore write 


(cos k€ cos kx + sin k€ sin kx)| dE 


> 
Il 


Ma 


1 


Ma 


cos k(€ — 2) dé. 


> 
Il 
= 
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As a functions of t, both f(x +t) and D,(t) are periodic in 27, so we can use 
the relation (3.14) to write 


T 


Sala) = | F(a +i)Dnlt)dt. 


On the other hand, in view of (3.17) and the fact that D,, is an even function, 


1 0 TT 
sitet) + fe =F) | Dalat + fet) | Daledt 
=T 0 


Hence 


1 0 
Sla) - zeH + Fa) = f [f(@ +t) — f(x )]Dn(t)dt 


+f Veri- f(w*) Da (tat, 
0 


and we have to show that this sequence tends to 0 as n — oo. 
Define the function 


f(a+t) — f(a) O 
g(t) = Cn TN 

f@+O)-f0) plan 

ett —] i 


Using (3.16) we can write 


Sna) — EEH E= f Oe- Da (oat 


1 7 i , 

a t i(n+1)t _ „—int dt 

FPO ae 
and it remains to show that the right-hand side tends to 0 as n — oo. Because 
f is piecewise smooth on [—7,7], the function g is also piecewise smooth on 
[—7, 7] except possibly at t = 0. At t = 0 we can use D’Hôpital’s rule to obtain 
; ; '(x+t . + 
lim g(t) = lim uaa) = ~i f'(x”). 


t—0+ t0+ te’ 


Consequently g is piecewise continuous on [—7, 7m], its Fourier coefficients con- 
verge to 0 by Lemma 3.7, and therefore 


1 [* 
| g(t)e™" dt = 0 asn— oo. 


TT 
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Remark 3.10 


1. If f is continuous on [—7, 7] then $[f(a*) + f(a~)] = f(x) for every x € R, 
so that the Fourier series converges pointwise to f on R. Otherwise, where f 
is discontinuous at x, its Fourier series converges to the median of the “jump” 
at x, which is $[f(a*) + f(x7)], regardless of how f(x) is defined. This is 
consistent with the observation that, because the Fourier series S$ is completely 
determined by its coefficients a, and bn, and because these coefficients are 
defined by integrals involving f, the coefficients would not be sensitive to a 
change in the value of f(x) at isolated points. If f is defined at its points of 


discontinuity by the formula 
1 = 
Fle) = Se) + F7), 


then we would again have the pointwise equality S(x) = f(x) on the whole 
of R. 


2. Using the exponential form of the Fourier series, we have, for any function 
f which satisfies the hypothesis of the theorem, 


Z l 1 
im 5 Che = sl f(2*) +f(x-)] forall z eR, 


k=—-n 


where Lt 
Cn = = | f(z) "de. 
2T Ja 


3. Theorem 3.9 gives sufficient conditions for the pointwise convergence of the 
Fourier series of f to 4[f (x+) + f(<7)]. For an example of a function which is 
not piecewise smooth, but which can still be represented by a Fourier series, 
see page 91 [16], or Exercise 3.26. 


As one would expect, the interval [—7,7] may be replaced by any other 
interval which is symmetric with respect to the origin. 


Corollary 3.11 


Let f be a piecewise smooth function on [—1,/] which is periodic in 21. If 


1 l 
a= af fode 
1 l 
An = if f(x)cos oe dx, 
=l 


1/7’ 
m= Tf f(x)sin a dx, neN, 
=l 
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then the Fourier series 


lo) 
ao + Jo (ancos 8 + basin a) 


n=1 


converges at every x in R to $[f(xt) + f(<7)]. 


Proof 


Setting g(x) = f(la/m) we see that g(a + 27) = g(x), and that g satisfies the 
conditions of Theorem 3.9. 


In Example 3.4 the function f is piecewise continuous on (—7, 7], hence its 
periodic extension to R satisfies the conditions of Theorem 3.9. Note that f is 
discontinuous at all integral multiples of 7, where it has a jump discontinuity 
of magnitude |f(a*) — f(x;)| = 2. At z = 0 we have 


1 ate 1 
SUOI = 51-1] =9, 


which agrees with the value of its Fourier series 


Co 


4 1 : i 
S(x) = = 5 amri sin(2n + 1)x 


n=0 


at « = 0. Because f(0) was defined to be 0, the Fourier series converges to 
f at this point and, by periodicity, at all other even integral multiples of r. 
The same cannot be said of the point x = 7, where f(t) = 1 4 S(r) =0. By 
periodicity the Fourier series does not converge to f at x = +7,+37,... . To 


obtain convergence at these points (and hence at all points in R), we would 


have to redefine f at m to be 
fn) = Zia) + Fa) <0. 


In this same example, because f is continuous at x = 7/2, we have f(1/2) = 
S(a/2) = 1. Hence 


Co 


4 1 z i T 
I= 2 a en H 1)5 


n=0 
4a 1 

on —1)” 
DD DET i; 


which yields the following series representation of 7 : 


II 
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1 1 1 
S Apert 3.18 
i ( M ae ) (518) 
Given any real function f, we use the symbol f* to denote the positive part 


of f, that is, 
_f f(x) if f(a) 20 
paed 0 if f(x) <0 
ORON 
Similarly, the negative part of f is 


ao E oo 


and we clearly have f = f+ — f7. 


Example 3.12 


The function f(t) = (2cos 1007t)*, shown graphically in Figure 3.5, describes 
the current flow, as a function of time t, through an electric semiconductor, also 
known as a half-wave rectifier. The current amplitude is 2 and its frequency 
is 50. 

To expand f, which is clearly piecewise smooth, in a Fourier series we first 
have to determine its period. This can be done by setting m/l = 1007, from 
which we conclude that | = 1/100. Because f is an even function, bn = 0 for 
all n € N and ay, are given by 


1 l 
o= af feat 


l 
[flee 


1/200 
= 100 / 2 cos 100ztdt 
0 


= 2/n, 


fO 


9 


l 


— 0.01 0.01 0.02 0.03 t 


Figure 3.5 Rectified wave. 
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2 A 
an= =J f(t)cos Midt 
L Jo l 
1/200 
= 200 f 2 cos 1007t cos 100n7t dt 
0 
1/200 
Z 200 f [cos(n + 1)100rt + cos(n — 1)100rt]dt, neN. 
0 
When n= 1, 


1/200 
a, = 200 f (cos 2007t + 1)dt = 1. 
0 


For all n > 2, we have 


9 1 1/200 
an = sin(n + 1)1007¢ + sin(n — 1)1007t 
m|n+1 n—1 0 
2 1 1 
at = i sin(n + 1) > = 7 sin(n D3] 
_ 2 1 } 
ap kntl n=l ree 
— 4 OS 
ie cos ns. 
Thus 
4 4 
a2 = yz» a3 = 0, a4 = — Te 5 = 0). fags 
Because f is a continuous function we finally obtain 
2 4 (1 1 
f(t) = — + cos 100at + — | = cos 200rt — — cos 400at + --- 
T TAS 15 
2 4 & (1) 
= z t Cos 1007t — = > mz °° 200nzt for allt € R. 


The continuity of f is consistent with the fact that the series converges uni- 
formly (by the M-test). The converse is not true in general, for a convergent 
series can have a continuous sum without the convergence being uniform, as 
in the pointwise convergence of X x” to 1/(1 — x) on (—1,1). But this cannot 
happen with a Fourier series, as we now show. 


Theorem 3.13 


If f is a continuous function on the interval [—7,7] such that f(—7) = f(r) 
and f’ is piecewise continuous on (—7, mT), then the series 


5 y lanl” + |r|? 
n=1 
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is convergent, where a, and b, are the Fourier coefficients of f defined by 


bf? 1 
Qn = = f(x)cos nz dz, bn = — f(x)sin na dx. 
-r T Jax 


Observe that the conditions imposed on f in this theorem are the same 
as those of Theorem 3.9 with piecewise continuity replaced by continuity on 
[—7, r]. 


Proof 


Because f’ is piecewise continuous on [—7, 7], it belongs to £?(—7, 7) and its 
Fourier coefficients 


1 T 
a = on f'(x)dz, 
1 T 
a = = f'(x)cosnax dz, 
1 T 
bv, =-=] f'(x)sinnz dz, neéeN, 
T 


-p 


exist. Integrating directly, or by parts, and using the relation f(—7) = f(r), 
we obtain 


1 
a = If) — F(—m)] =0, 
a, = —f(a)cosnz}| + 2 f(x)sinng dz = nbn, 
a Se 
1 1 ‘ í n f7 
bn = —f(x)sin ng e f(x)cosnx dx = —nay. 
T En = 


Consequently, 
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where we use the CBS inequality in the last relation. By Bessel’s inequality 
(1.22) we have 


N T 
2 lanl? A $) < a IF’ (x)|? da < o. 


The series X` 1/n? converges, and thus the increasing sequence Sy is bounded 
above and is therefore convergent. 


Corollary 3.14 
If f satisfies the conditions of Theorem 3.13, then the convergence of the Fourier 
series 

ao + >> (ancos na + bnsin nz) (3.19) 


n=1 


to f on [—7, 7] is uniform and absolute. 


Proof 


The extension of f from [—7, 7] to R by the periodicity relation 
f(a@+2r)=f(x) forallxeR 


is a continuous function which satisfies the conditions of Theorem 3.9, hence 
the Fourier series (3.19) converges to f(x) at every x € R. But 


lancos nz + bysinna| < jan| + [bn] < 24 lan]? + bnl’, 


and because 5 4/ |an|? + |bn|? converges, the series (3.19) converges uniformly 
and absolutely by the M-test. 


Based on Corollaries 1.19 and 3.14, we can now assert that a function which 
meets the conditions of Theorem 3.9 is continuous if, and only if, its Fourier 
series is uniformly convergent on R. Needless to say, this result remains valid 
if the period 27 is replaced by any other period 21. 


Corollary 3.15 


If f is piecewise smooth on [—1,/] and periodic in 21, its Fourier series is uni- 
formly convergent if, and only if, f is continuous. 
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EXERCISES 


3.8 Prove Equation (3.14). 


3.9 Determine which of the following functions are piecewise continuous 
and which are piecewise smooth. 


(a) f(x) = Ya, ceER. 

(b) f(z) =|sinz|, z ER. 

(c) f(z) = yz, 0< x£ <1, f(a+1) = f(x) for all z >0. 
(d) f(x) = |r, -1 <2 <1, f(z +2) = f(a) forall x € R. 
(e) f(z) = [e] =n for all z € [n,n +1), n €Z. 


3.10 Show that the function f(x) = |a| is piecewise smooth on R, but 
not differentiable at x = 0. Give an example of a function which is 
piecewise smooth on R but not differentiable at any integer. 


3.11 Show that the function 


fa) = { EA ae 


is differentiable, but not piecewise smooth, on R. 


3.12 Assuming f is a piecewise smooth function on (—1,1) which is peri- 
odic in 2l, show that f is piecewise smooth on R. 


3.13 Suppose that the functions f and g are piecewise smooth on the 
interval J. Prove that their sum f + g and product f- g are also 
piecewise smooth on J. What can be said about the quotient f/g? 


3.14 Determine the zeros of the Dirichlet kernel D,, and its maximum 
value. 


3.15 Verify that each of the following functions is piecewise smooth and 
determine its Fourier series. 


a) f(x) =x, -r <x<r, f(x +2r)= f(x) forall z ER. 
b) f(x) = |z|, -l<a<1, f(x +2) = f(x) forall z € R. 


d) f(x) = cos? x, x € R 
e) f(z) =e", —2 <x <2, f(x +4) = f(x) for alla € R. 
f) f(x) =x, -l<a<l, f(x + 2l) = f(x) for all z € R. 
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3.16 


3.17 


3.18 


3.19 


3.20 


3.21 


3.22 


3.23 


3.24 


3. Fourier Series 


In Exercise 3.15 determine where the convergence of the Fourier 
series is uniform. 


Determine the sum S(x) of the Fourier series at x = +2 in Exercise 
3.15(e) and at x = +l in Exercise 3.15(f). 


Use the Fourier series expansion of f(x) = x, =r <x <m, f(x + 
27) = f(x) on R, to obtain Equation (3.18). 


Use the Fourier series expansion of f(x) = |x|, =n <x <r, f(x + 
27) = f(x) on R, to obtain a representation of 7°. 


Use the Fourier series expansion of f in Example 3.12 to obtain a 
representation of 7. 


Determine the Fourier series expansion of f(x) = x? on [—7, 7], then 
use the result to evaluate each of the following series. 


Chie 


oo 1 
CODE Gao? 


Expand the function f(x) = |sinz| in a Fourier series on R. Verify 
that the series is uniformly convergent, and use the result to evaluate 
the series Xp; (4n? — 1)7* and Xp (—1)° tt (4n? — 1)71. 


Show that the function f(x) = |sin z| has a piecewise smooth deriv- 
ative. Expand f’ in a Fourier series and determine the value of the 
series at x = nr and at x = 77/24 nr, n € Z. Sketch f and f’. 


Suppose f is a piecewise smooth function on [0,1]. The even periodic 
extension of f is the function fe defined on R by 


x f(x), O<a<l 
f(-x), —l<x<0, 
F(z +21) = f(z), z € R; 
and the odd periodic extension of f is 
z _ J f(x), 0<x<l 
jute) = { —f(-2), —l <x <0, 


f(a+2l) = f(x), £1 E€ R. 


3.3 Boundary-Value Problems 117 


(a) Show that, if f is continuous on [0,1], then fe is continuous on 
R, but that fə is continuous if, and only if, f(0) = f(/) = 0. 


(b) Obtain the Fourier series expansions of fe and fo. 


(c) Given f(x) = x on [0,1], determine the Fourier series of fe and 
fo- 


3.25 For each of the following functions f : R — R, determine the value 
of the Fourier series at x = 0 and x = a, and compare the result 
with f(0) and f(z): 


(a) Even periodic extension of sin x from [0,7] to R. 
(b) Odd periodic extension of cos x from [0,7] to R. 
(c) Odd periodic extension of e” from [0,7] to R. 


3.26 Show that the Fourier series of f(x) = xt/3 converges at every 
point in [—7, 7]. Because f is continuous and lies in £2(—7, 7), con- 
clude that its Fourier series converges pointwise to f(x) at every 
x E€ (—7,7), although f is not piecewise smooth on (—7, 7). 


3.3 Boundary- Value Problems 


Fourier series play a crucial role in the construction of solutions to boundary- 
value and initial-value problems for partial differential equations. A solution 
of a partial differential equation will naturally be a function of more than one 
variable. When the equation is linear and homogeneous, an effective method 
for obtaining its solution is by separation of variables. This is based on the 
assumption that a solution, say u(x, y), can be expressed as a product of a 
function of x and a function of y, 


u(x, y) = v(x)w(y). (3.20) 


After substituting into the partial differential equation, if we can arrange the 
terms in the resulting equation so that one side involves only the variable x 
and the other only y, then each side must be a constant. This gives rise to two 
ordinary differential equations for the functions v and w which are, presumably, 
easier to solve than the original equation. The loss of generality involved in the 
initial assumption (3.20) is compensated for by taking linear combinations of all 
such solutions v(x)w(y) over the permissible values of the parametric constant. 
Two well-known examples of boundary-value problems which describe physical 
phenomena are now given and solved using separation of variables. 
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3.3.1 The Heat Equation 


Consider the heat equation in one space dimension 


du _ ou 
Ot Ox?’ 


which governs the heat flow along a thin bar of length l, where u(x,t) is the 
temperature of the bar at point x and time t, and k is a positive physical 
constant which depends on the material of the bar. To determine u we need to 
know the boundary conditions at the ends of the bar and the initial distribution 
of temperature along the bar. Suppose the boundary conditions on u are simply 


0<a<il,t>0, (3.21) 


u(0,t) = u(l,t) = 0, t > 0; (3.22) 


that is, the bar ends are held at 0 temperature, and the initial temperature 
distribution along the bar is 


u(x,0) = f(x), 0<zr<l, (3.23) 
for some given function f. We wish to determine u(x,t) for all points (x,t) in 
the strip (0,1) x (0,00). 

Let 
u(x,t) = v(x) w(t). 
Substituting into (3.21), we obtain 


which, after dividing by kuw, becomes 


v(x) w(t) 
v(a) kw) (3.2) 


Equation (3.24) cannot hold over the strip (0,1) x (0,00) unless each side is 
a constant, say —\*. The reason for choosing the constant to be —\* instead 
of A is clarified in Remark 3.16. Thus (3.24) breaks down into two ordinary 
differential equations, 
v” + dv =0, (3.25) 
w + kw =0. 
The solutions of (3.25) and (3.26), are 


u(x) = acos Az + bsin Az, (3.27) 
w(t) = ce kt (3.28) 
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where a, b, and c are constants of integration. Because these solutions depend 
on the parameter A, we indicate this dependence by writing v, and w). Thus 
the solutions we seek have the form 


u(x,t) = (a,cos Ax + bysin Ar)e kt, (3.29) 


where a) and b) are arbitrary constants which also depend on A. 
The first boundary condition in (3.22) implies 


uy(0, t) = ape Kt =0 for allt >0, 
from which we conclude that a, = 0. The second boundary condition gives 
uy(I,t) = basin Al ew” ** for all t > 0, 


hence bysin Al = 0. Because we cannot allow b) to be 0, or we get the trivial 
solution u, = 0 which cannot satisfy the initial condition (unless f = 0), we 
conclude that 


sin Al = 0, 


and therefore Al is an integral multiple of m. Thus the parameter À can only 
assume the discrete values 


An => neN. 


Notice that we have retained only the positive values of n, because the negative 
values have the effect of changing the sign of sin Ang, and can therefore be 
accommodated by the constants b,, = bn. The case n = 0 yields the trivial 
solution. Thus we arrive at the sequence of solutions to the heat equation, all 
satisfying the boundary conditions (3.22), 


—(nx/1)? kt 


Un(x,t) = bn sin Fare neéN. 


Equations (3.21) and (3.22) are linear and homogeneous, therefore the formal 
sum 


Meg 


u(x,t) = un(z, t) = 5 bnsin a eee (3.30) 
n=1 


n=1 


defines a more general solution of these equations (by the superposition prin- 
ciple). 
Now we apply the initial condition (3.23) to the expression (3.30): 


u(x, 0) = 5 bn sin 2 = f(x). 
n=1 
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Assuming that the function f is piecewise smooth on [0,/], its odd extension 
into [—l, l] satisfies the conditions of Corollary 3.11 in the interval [—1,/], and 
the coefficients b, must therefore be the Fourier coefficients of f; that is, 


I 
bn = al f(x)sin any dx. 
TJ i 


Here we are tacitly assuming that f(x) = 4[f(æ*) + f(x7)] for all x. This 
completely determines u(x,t), represented by the series (3.30), as the solution 
of the system of Equations (3.21) through (3.23) in [0,2] x [0, co). 


Remark 3.16 


1. Although the assumption u(x,t) = v(x)w(t) at the outset places a restriction 
on the form of the solutions (3.29) that we obtain, the linear combination (3.30) 
of such solutions restores the generality that was lost, in as much as the sequence 
sin(nrz/l) spans £(0, 1). 

2. The reason we chose the constant v(x) /v(x) = w'(t)/kw(t) to be negative 
is that a positive constant would change the solutions of Equation (3.25) from 
trigonometric to hyperbolic functions, and these cannot be used to expand f(x) 
(unless A is imaginary). The square on was introduced merely for the sake of 
convenience, in order to avoid using the square root sign in the representation 
of the solutions. 


In higher space dimensions, the homogeneous heat equation takes the form 
us = kAu, (3.31) 


where A is the Laplacian operator in space variables. In R” the Laplacian is 
given in Cartesian coordinates by 
3? oF ə? 


=t ee 
ðr? 0x8 Ox? 


Example 3.17 


The dynamic (i.e., time-dependent) temperature distribution on a rectangular 
conducting sheet of length a and width b, whose boundary is held at 0 tem- 
perature, is described by the following boundary-value problem for the heat 
equation in R?. 


Ut = k(Uzx T Uyy), (x, y) € (0, a) x (0, ), t > 0, 
u(0,y,t) = u(a,y,t)=0, ye (0,d), t>0, 
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u(x,0,t) = u(x, b,t) = 0, x € (0,a), t>0, 
u(z,y,0) = f(x,y), (x,y) € (0,a) x (0,b), 
where we use the more compact notation 


Ou Ou O7u 3u Ou 
Up a Us = gy’ Uss = 52? “ey Buda’ “yy = Gyz’ 


The assumption that 
u(x, y, t) = v(x, y)w(t) 


leads to the equation v(x, y)w’(t) = kw(t)[uzs(£, y) + Vvyy(x, y)]. After dividing 
by vw and noting that the left-hand side of the resulting equation depends 
on ¢ and the right-hand side depends on (x,y), we again assume a separation 
constant —À° and obtain the pair of equations 


w + kw = 0, 
Ura + Vyy + Mv = 0. 


Equation (3.32), as is (3.26), is solved by the exponential function (3.28). We 
can use separation of variables again to solve (3.33). Substituting v(x, y) = 
X(x)Y (y) into (3.33) leads to 


x" (æ) Y"), 


T T Me = 0, 
X(x)  Y(y) 
where the variables are separable, 
LE O pp 
X(x) Y (y) 


and —u? is the second separation constant. The resulting pair of equations has 
the general solutions 


X(x) = Acos pa + Bsin px, 


Y (y) = A’ cos Ap Ae = u?y + B’ sin / X? — py. 


The boundary conditions at x = 0 and y = 0 imply A = A’ = 0. From the 
condition at x = a we obtain u = yu, = na/a for all positive integers n, and 
from the condition at y = b we conclude that 


and therefore 
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Thus we arrive at the double sequence of functions 


Umn(2, Y, t) = bmne Ramen sin PT t sin Ey, m,n E€ N, 
a 
which satisfy the heat equation and the boundary conditions. 
Before applying the initial condition we form the formal double sum 


OO 
ulz, y, t) = 5 bmne tnnt sin x sin aT (3.34) 
mn=1 
which satisfies the boundary conditions on the sides of the rectangular sheet. 
Now the condition at t = 0 implies 


MT 


; (3.35) 


foe) 
nT 
Y= bmn Sin —2 si 
f(x,y) 5 mn sil a in 


m,n=1 


Assuming f is square integrable over the rectangle (0, a) x (0, b), we first extend 
f as an odd function of x and y into (—a, a) x (—b, b). Then we multiply Equation 
(3.35) by sin(n’x/a)x and integrate over (—a,a) to obtain 


ee) 2 a 1 
5 bmn’ sin Ly = 2f f(x, y)sin 2T rde. 
net b 0 a 


o NT 
sin — zx 
a 


Multiplying by sin(m’z/b)y and integrating over (—b, b) yields 


n'r |? m'r ||? b a nT MT 
bmn’ ||sin — z|| ||sin yll = af | f(x, y)sin — zx sin y dady. 
a b o Jo a b 
Since 
2 2 nT 
||sin(n’r/a)z|| = / sin? (=) dr =a 
oe a 
and 


b 
Isin’? = f sin? (= 
—b a 


the coefficients in (3.34) are given by 


4 a pb 
binii = a f f(x, y)sin a sin Ty dady. (3.36) 
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The expression on the right-hand side of (3.35) is called a double Fourier 
series of f and the equality holds in £? over the rectangle (0, a) x (0,6). Under 
appropriate smoothness conditions on the function f, such as the continuity 
of f and its partial derivatives on [0,a] x [0,5], (3.35) becomes a pointwise 
equality in which the Fourier coefficients bmn are defined by (3.36). The series 
representation (3.34) is then convergent and satisfies the heat equation in the 
rectangle (0, a) x (0, b), the conditions on its boundary, and the initial condition 
at t = 0. 


It is worth noting that the presence of the exponential function in (3.30) 
and (3.34) means that the solution of the heat equation decays exponentially 
to 0 as t — ov. This is to be expected from a physical point of view, because 
heat flows from points of higher temperature to points of lower temperature; 
and because the edges of the domain in each case are held at (absolute) zero, 
all heat eventually seeps out. Thus, in the limit as t — oo, the heat equation 
becomes 

Au = 0, 


known as Laplace’s equation. This is a static equation which, in this case, 
describes the steady state distribution of temperature over a domain in the 
absence of heat sources; but, more generally, Laplace’s equation governs many 
other physical fields which can be described by a potential function, such as 
gravitational, electrostatic, and fluid velocity fields. Some boundary-value prob- 
lems for the Laplace equation are given in Exercises 3.34 through 3.37. We shall 
return to this equation in the next chapter. 


3.3.2 The Wave Equation 


If a thin, flexible, and weightless string, which is stretched between two fixed 
points, say x = 0 and x = J, is given a small vertical displacement and then 
released from rest, it will vibrate along its length in such a way that the (ver- 
tical) displacement at point x and time t, denoted by u(x,t), satisfies the wave 
equation in one space dimension, 


Ute = C’Une, here 140, (3.37) 


c being a positive constant determined by the material of the string. This 
equation describes the transverse vibrations of an ideal string, and it differs 
from the heat equation only in the fact that the time derivative is of second, 
instead of first, order; but, as we now show, the solutions are very different. 
The boundary conditions on u are naturally 


u(0,t)=u(l,t)=0, +t>0, (3.38) 


124 3. Fourier Series 


and the initial conditions are 
u(x,0) = f(x), uz(z,0) = 0, 0<a<l, (3.39) 


f(x) being the initial shape of the string and 0 the initial velocity. Here two 
initial conditions are needed because the time derivative in the wave equation 
is of second order. 


Again using separation of variables, with u(x,t) = uv(x)w(t), leads to 


vu" (a) w” (t) 


_ a 98 
v(x) w(t) as 
Hence 
v" (x) + v(x) = 0, O0<a<l, 
w” (t) + eA w(t) = 0, t>0, 


whose solutions are 


v(x) = acos Ax + bsin àz, 
w(t) = a' cos càt + b' sin căt. 


As in the case of the heat equation, the boundary conditions imply a = 0 and 
A =nr/l with n € N. Thus each function in the sequence 


Cn Cnt NT 
Un(x, t) = (ancos Tt + b,sin at) sin Th nEeN, 


satisfies the wave equation and the boundary conditions at x = 0 and x = l, so 
(by superposition) the same is true of the (formal) sum 


-5 (ancos 7 Mri bnsin TE t) sin =e (3.40) 


The first initial condition implies 
z nT 
S ansin =a = f(z), O0<a<il. 
n=1 


Assuming f is piecewise smooth, we again invoke Corollary 3.11 to conclude 
that 


l 
T= F f(x)sin Me dz. (3.41) 
0 


The second initial condition gives 


D i bncos Tt = 0, 0<a<l, 


n=1 
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hence bn = 0 for all n. 
The solution of the wave equation (3.37) under the given boundary and 
initial conditions (3.38) and (3.39) is therefore 
u(x,t) = 2 An COS st sin ae (3.42) 
where a, are determined by the Fourier coefficient formula (3.41). Had the 
string been released with an initial velocity described, for example, by the 
function 
ur(a, 0) = g(x), 
the coefficients bn in (3.40) would then be determined by the initial condition 


5 TE by sin 0 = g(x) 


as i 
bn = ea g(x)sin ne de. 
cnt Jo l 

Note how the vibrations of the string, as described by the series (3.42), 
continue indefinitely as t — oo. That is because the wave equation (3.37) does 
not take into account any energy loss, such as may be due to air resistance 
or internal friction in the string. This is in contrast to the solution (3.30) of 
the heat equation which tends to 0 as t — œ as explained above. In Exercise 
3.32, where the wave equation includes a friction term due to air resistance, 


the situation is different. 


EXERCISES 


3.27 Use separation of variables to solve each of the following boundary- 
value problems for the heat equation. 


(a) Up = Ure, OX a<am, t>0, 
u(0,t) = u(z,t) =0, t > 0, 
u(z,0) = sin? z, 0< £ <r. 
(b) up = kuss, O< a <3, t>0, 
u(0,t) = uz(3,t) = 0, t > 0, 

Sa 


u(x, 0) = sin Fx — sin Fe. 


The condition that uz = 0 at an endpoint of the bar means that the 
endpoint is insulated, so that there is no heat flow through it. 
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3.28 


3.29 


3.30 


3.3 


pare 


3.32 
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If a bar of length l is held at a constant temperature Tọ at one end, 
and at T; at the other, the boundary-value problem becomes 


ut = kuee, 0<2<l,t>0, 
u(0, t) = To, u(l, t) =T, t>0, 
u(x,0) = f(x), O0<a<l. 
Solve this system of equations for u(x,t). Hint: Assume that u(x, t) = 


u(x,t) + y(x), where v satisfies the heat equation and the homoge- 
neous boundary conditions v(0,t) = v(l, t) = 0. 


Solve the following boundary-value problem for the wave equation. 


Utt = Urz, 0<a<_7,/1>0, 
u(0,t) = u(l, t) = 0, t>0, 
u(z,0) = a(l— x), uz(z,0) =0, O<a<l. 


Determine the vibrations of the string discussed in Section 3.3.2 if 
it is given an initial velocity g(a) by solving 
Utt = C Uzr, 0<xz<l,t>0, 
u(0,t) = u(l,t) = 0, t>0, 
u(z,0)= f(x),  w(x,0) = g(x), 0<z<l. 


Show that the solution can be represented by d’Alembert’s formula 


2 7 at+ct 
uzt) = sitet et) e-d] Od, 


2c tct 


where f and g are the odd periodic extensions of f and g, respec- 
tively, from (0,1) to R. 


If gravity is taken into account, the vibrations of a string stretched 
between x = 0 and x = l are governed by the equation ut = 
C Urr — g, where g is the gravitational acceleration constant. De- 
termine these vibrations under the homogeneous boundary condi- 
tions u(0,t) = u(l,t) = 0, and the homogeneous initial conditions 
u(a,0) = uz(x,0) = 0. 


If the air resistance to a vibrating string is proportional to its veloc- 
ity, the resulting damped wave equation is 


2 
Utt + kut = C Ugre 
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for some constant k. Assuming that the initial shape of the string 
is u(z,0) = 1, 0 < x < 10, and that it is released from rest, de- 
termine its subsequent shape for all (x,t) € (0,10) x (0,00) under 
homogeneous boundary conditions. 


3.33 The vibrations of a rectangular membrane which is fixed along its 
boundary are described by the system of equations 
Utt = C (Ura + Uyy), 0<a<a, 0<t<b, t>0, 
u(0,y,t) =ula,y,t)=0, O<y<b t>0, 
u(x,0,t) = u(a,b,t) =0, 0<2<a, t>0, 
Solve this system of equations in the rectangle (0,a) x (0,0) for all 
t> 0. 


3.34 Determine the solution of Laplace’s equation in the rectangle (0, a) x 
(0,6) under the conditions u(0,y) = u(a,y) = 0 on 0 < y < b and 
u(x, 0) = f(x), uy(x,b) = g(x) on 0 < z < a. 


3.35 Solve the boundary-value problem 


Ure + Uyy = 0, 0<zr<l, 0<y<l, 
u(0, y) = uz(1l, y) =0, 0 <y < 1, 


3 
u(x, 0) = 0, u(x, 1) = sin e 


3.36 Solve the boundary-value problem 


Una t Uyy = 0, 0<a<7, 
uz(0, y) = uz(T, y) = 0, O<y<T, 


uy(x,0) = cosx, uy(x, 7) = 0, O<a<n. 
3.37 Laplace’s equation in polar coordinates (r,@) is given by 
1 1 
Urr + Ur + -z u00 = 0. 
r r 


(a) Use separation of variables to show that the solutions of this 
equation in R? are given by 


un(r,0) = ao + do log r, Im 
nO U(r” + dnr—")(ancos nd + basin nð), n €N, 


where an, bn, and dn are integration constants. Hint: Use the 
fact that u(r,0 + 27) = u(r, 0). 
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(b) Form the general solution u(r, 0) = 377°, un(r,@) of Laplace’s 


equation, then show that the solution of the equation inside the 
circle r = R which satisfies u(R, 0) = f(@) is given by 


u(r, 0) = Ao + 1 " (Ancos né + Basin n0), 
i L(a) 


for all r > R, 0 < 6 < 2r, where A,, and Bn are the Fourier 
coefficients of f. 


Determine the solution of Laplace’s equation in polar coordi- 
nates outside the circle r = R under the same boundary condi- 
tion u(R, 0) = f(0). 


4 


Orthogonal Polynomials 


In this chapter we consider three typical examples of singular SL problems 
whose eigenfunctions are real polynomials. Each set of eigenfunctions is gener- 
ated by a particular choice of the coefficients in the eigenvalue equation 


(pu’)’ + ru + Apu = 0, (4.1) 
and of the interval a < x < b. In all cases the equation 
p(u'v — wi’)? = 0, (4.2) 


of course, has to be satisfied by any pair of eigenfunctions u and v. 

If {y, : n € No} is a complete set of eigenfunctions of some singular SL 
problem, then, being orthogonal and complete in Lab), the sequence (¢,,) 
may be used to expand any function f € L?(a, b) by the formula 


aeS Ee (4.3) 


2 fn 
nao lenll 


in much the same way that the trigonometric functions cos ng or sin ng were 
used to represent a function in £2(0,7). Thus we arrive at a generalization of 
the Fourier series which was associated with the choice p(x) = 1, r(x) = 0, 
p(x) = 1, and (a,b) = (0,7) in (4.1). Consequently, the right-hand side of 
Equation (4.3) is referred to as a generalized Fourier series of f, and 


(F, Pn) p 


Cn = — z, neENo, 
lenll 
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are the generalized Fourier coefficients of f. 
A corresponding result to Theorem 3.9 also applies to the generalized 
Fourier series: If f is piecewise smooth on (a,b), and 


1 b 
Cn = —Z L)~, (x) p(x)daz, 
rae, A 


then the series o 
Se) = È enva(a) 


converges at every x € (a,b) to 4[f (xt) + f(x7)]. A general proof of this result 
is beyond the scope of this treatment, but some relevant references to this 
topic may be found in [5]. Of course the periodicity property in R, which was 
peculiar to the trigonometric functions, would not be expected to hold for a 
general orthogonal basis. 


4.1 Legendre Polynomials 


The equation 


(1 — z°)ju” — 2g +u =0, -l<2<1, (4.4) 


is called Legendre’s equation, after the French mathematician A.M. Legendre 
(1752-1833). It is one of the simplest examples of a singular SL problem, the 
singularity being due to the fact that p(x) = 1 — x? vanishes at the endpoints 
x = +1. By rewriting Equation (4.4) in the equivalent form 


2 
i E yee (4.5) 


(a tiz 


u 
we see that the coefficients are analytic in the interval (—1, 1), and the solution 


of the equation can therefore be represented by a power series about the point 
x = 0. Setting 


u(x) = X orat, -l<a«<l, (4.6) 
k=0 


and substituting into Legendre’s equation, we obtain 
(1— 27) 5 k(k—1)e,2*-? — 2 5 kepe” + aS ckz? = 0 
k=1 k=0 


Stk +2)(k + 1)ck42 + (A — k? — k)cx]x* = 0, 
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from which 
k(k+1)-— à 


2 TREE +2) 


Equation (4.7) is a recursion formula for the coefficients of the power series 
(4.6) which expresses the constants cp for all k > 2 in terms of cg and cı, and 
yields two independent power series solutions, one in even powers of x, and the 
other in odd powers. 


(4.7) 


If A = n(n +1), where n € No, the recursion relation (4.7) implies 


0 Cn+2 Cn+4 Cn+6 Pane 


and it then follows that one of the two solutions is a polynomial. In that case 
(4.7) takes the form 


k(k +1) —n(n+1) (k-—n)(k+n+1) 


= = . 4.8 
ETZ (kKt+D(k+2) FDR | a) 
Thus, with cg and cı arbitrary, we have 
n(n+1 n—1)(n+2 
= tN, gg = PUT), 
n(n — 2)(n + 1)(n +3) (n — 3)(n— 1)(n + 2)(n + 4) 
C4 = Al Co, 65 = 5l C1 


and the solution of Legendre’s equation takes the form 


u(x) = co h ED, DEED ED a... 


2! 4! 
n—1)(n+2 n—3)(n—1)(n+2)(n+4 
as [e COE Dae Nana Lab 


= couo(x) + ciu (x), 


where the power series uọo(x) and u(x) converge in (—1,1) and are linearly 
independent, the first being an even function and the second an odd function. 


For each n € No we therefore obtain a pair of linearly independent solutions, 
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n=2, w(x) = 1-32? 
2 1 
ulz) =z- zI? — pa? + ; 
n=3, uo(z) =1-62?+ 324+ 
ulz) =z- 208, 


one of which is a polynomial, and the other an infinite power series which 
converges in (—1, 1). We are mainly interested in the polynomial solution, which 
can be written in the form 


ant” Fanat Hapat tee (4.9) 


This is a polynomial of degree n which is either even or odd, depending on the 

integer n. By defining the coefficient of the highest power in the polynomial 
ve ODE aera (Qn—1) 

n l . re? MELELE n — 

p= = ; 4.10 

s 2"(n!)? n! (ey) 

the resulting expression is called Legendre’s polynomial of degree n, and is 

denoted by P,(«). As a result of this choice, we show in the next section that 

P,(1) = 1 for all n. The other coefficients in (4.9) are determined in accordance 


with the recursion relation (4.8): 


ean. n(n — 1) F 
e= n= A 
n(n—1) (2n)! 
2(2n — 1) 2” (n!)? 
(2n — 2)! 
2”(n — 1)!(n — 2)! 
n= 2)ln= 3) | 
mag A(Qn—3) 77? 
_ (2n — 4)! 
~ 2"2'(n — 2)!(n — 4)! 


(2n — 2k)! 
> 2k 
mkn — k(n- 2 "=o" 


An—-2k = ( 1)* 


where we arrive at the last equality by induction on k. The last coefficient in 
P, is given by 
n! 


a = (1)? 2” (n/2)\n/2)! 
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if n is even, and 
(n+ 1)! 
ane es 
when n is odd. Thus we arrive at the following representation of Legendre’s 
polynomial of degree n, 


a, = (C1092 


1 ee) Qn-2k)! ng 
Palt) = on D( va ina i (Eu 
0 


where [n/2] is the integral part of n/2, namely n/2 if n is even and (n — 1)/2 
if n is odd. The first six Legendre polynomials are (see Figure 4.1) 


P(x) = 1, P(x) = 2, 
P2(x) = $(3x? — 1), P3(x) = $(52° — 32), 
P,(x) = 5(35a* — 302? + 3), P3(x) = (63x5 — 70x + 152). 


Figure 4.1 Legendre’s polynomials. 
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The other solution of Legendre’s equation, known as the Legendre function 
Qn, is an infinite series which converges in the interval (—1,1) and diverges 
outside it (see Exercise 4.3). The Legendre function of degree n = 0 is given by 

1 


1 
Qo(z) = r+ 32° + a? +o 


ati 1+zr 
=3 BlI z)’ 


This becomes unbounded as « tends to +1 from within the interval (—1, 1). The 
same is true of Q(x) (Exercises 4.3 and 4.4). The other Legendre functions 
Qn can also be shown to be singular at the endpoints of the interval (—1, 1). 
The only eigenfunctions of Legendre’s equation which are bounded at +1 are 
therefore the Legendre polynomials P,,. With p(x) = 1 — xz? = 0 at x = +1, the 
condition (4.2) is satisfied. 

Thus the differential operator 


a la A 


in Legendre’s equation is self-adjoint. Its eigenvalues An = n(n + 1) tend to oo, 
and its eigenfunctions in £?(—1,1), namely the Legendre polynomials P,,, are 
orthogonal and complete in £?(—1,1), in accordance with Theorem 2.29. We 
verify the orthogonality of P, directly in the next section. 


EXERCISES 


4.1 Verify that P, satisfies Legendre’s equation when n = 3 and n = 4. 


4.2 Use the Gram-Schmidt method to construct an orthogonal set of 
polynomials out of the independent set 


{1,a,27,0°,0+,25:-1<2¢ <1}. 


Compare the result with the Legendre polynomials P(x), Pi(x), 
P(x), P(x), P4(x), and Ps(x), and show that there is a linear re- 
lation between the two sets of polynomials. 


4.3 Use the ratio test to prove that (—1, 1) is the interval of convergence 
for the power series which defines the Legendre function Q,(x). Show 
that the singularities of Qo(a) at x = +1 are such that the product 
(1—2?)Q6(zx) does not vanish at x = +1, whereas (1—2?)Qo(x) > 0 
as x — +1, hence the boundary condition (4.2) is not satisfied when 
u = Qo, p(x) = 1 — 2”, and v is a smooth function on [—1, 1]. 
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4.4 Prove that 


Pyn41(0) = 0, 
n (2n — 1)(2n — 3)--- (3)() 
(2n)(2n — 2)-+-(4)(2) | 


4.6 Show that the substitution x = cos@, u(cos@) = y(@) transforms 
Legendre’s equation to 
d? d 
sin 02 + cos A +n(n + 1)sind y = 0, 


where 0 < 6 < m. Note the appearance of the weight function sin 0 
in this equation. 


4.2 Properties of the Legendre Polynomials 


For any positive integer n, we can write 


k=0 
[n/2] n! 
= k 2n—2k 
2 OD cers 
k=0 
LÈ Ota e 
kl(n — k)! l l 
k=[n/2]+1 
Because 
n/2+1 if n is even 
[n/2] + ae if n is odd 
implies 
n—2 if n is even 
2n— n/a] +1) = n if n is odd 
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it follows that the powers on x in the second sum on the right-hand side of 
Equation (4.12) are all less than n. Taking the nth derivative of both sides of 
the equation, therefore, yields 


dx” da” 7 k!(n — k)! 
RA ,(2n —2k)---(n+1—2k) n-k 
ao 1 TE z 
[n/2] 
mn 2 i To a 


Comparing this with (4.11), we arrive at the Rodrigues formula for Legendre 


polynomials, 
1 ad” 
P, (2) = ——— 
(2) 2° n! dx” 


which provides a convenient method for generating the first few polynomials: 


(x? — 1)", (4.13) 


P(x) = L; 
ld 
Pila) = 5a (1) =2, 
i ame Pe ee 


It can also be used to derive some identities involving the Legendre polynomials 
and their derivatives, such as 


Pryi(@) — Pr_i(a) = (2n + 1)Pr(2), (4.14) 
(n+ 1)Pr4i(x) + nPn-1(£) = (2n+1)¢P, (xz), neN, (4.15) 


which are left as exercises. 


We now use the Rodrigues formula to prove the orthogonality of the Legen- 
dre polynomials in £?(—1,1). Suppose m and n are any two integers such that 
0<m<_n. Using integration by parts, we have 


1 1 1 d” 
J P, (x£)x” dz = / ge (x? —1)"dz 
= 4 = 


22! 1 dx” 


1 qr-} 5 1 1 dr—1 
ons m 1)” _ m— 1 2.4 1)\"d 
Mn! | EAN TY mf 2 TE 
gi 1 1 m—2 
TM m1 d A 2 n m(m = 2f m—2 d 2 n 
= 1 1)"dzx. 
an) dx”? @ ) La 27n! zi dx” ae yee 
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Noting that all derivatives of (x? — 1)” of order less than n vanish at x = +1, 
and repeating this process of integration by parts, we end up in the mth step 
with 

1 
Granj =0, 


-1 


(—1)™m! qr—m-1 


2n! der —m—1 


1 
J P, (x)x™ dr = 
-1 


Because 0 < n— m — 1 < n. Thus we see that P,, is orthogonal to x” for all 
m < n, which implies that P,P, for all m < n. By symmetry we therefore 
conclude that 

(Pa, Pm) =0 for all m#n. 


To evaluate ||P,,||, we use formula (4.13) once more to write 


\| Pall? = | P@e = sae | VO OPa), (4.16) 


where y(x) = (x? — 1)”, and then integrate by parts: 


1 1 
y) (ay (a)de = — | jey aai 
=i 


—1 


= (-1)"@2n)! | y(x)dx. (4.17) 


cae (x? —1)"dz = T (l—2)"(1+2)"dzr 


1 1 


n 


= a (l—g)*"*(1+a)""de 


n+l =i 


n! i 2n 
“Gare E 


(nl)? (1 + x)?” + j" 


(Qn)! 2m+1 


—1 


= (aloe 
~ (2n)! (2n +1) oe 
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Combining Equations (4.16) through (4.18), we get 


2 
Pr — Bau , 
Pal =Ara eae Ae 


and hence the sequence of polynomials 


1 3 5 2n+1 
ato) TEO [Pala ane 5 Pile) 2: 


is a complete orthonormal system in £?(—1, 1). Note how ||P,,|| , unlike ||sin n2|| 
and ||cos na||, actually depends on n and tends to 0 as n — ov. 


Example 4.1 


Since the function 


fe)={ 1 -l<a2<0 


1, 0<a<l1 


is square integrable on (—1,1), it can be represented in £7(—1,1) by the 
Fourier—Legendre series 


f(z) = > enPa(a) 


in which the Fourier—-Legendre coefficients c, are determined, according to the 
expansion formula (4.3), by 


LP REN PE a 
“jar 2 J, PD: 


The series ` cn P, is also referred to as a Legendre series and c,, its Legendre 
coefficients. Thus 


1 3 7 
F(z) = 5 Pola) + 7P) hat (4.19) 
Observe that, in as much as 
1 7 1 f =1/2; -l<az<0 
fee) = Pola) =H) F=f GSES 


is an odd function, the Legendre series representation of f does not contain 
any even Legendre polynomials except Pp. Furthermore, because P,,(0) = 0 for 
every odd n, the right-hand side of (4.19) at z = 0 equals 
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=Po(0) = 5 = 50") + FOL, 


in agreement with the generalized version of Theorem 3.9 referred to at the 
beginning of this chapter. 


The Legendre polynomials are generated by the function (1 — 2xt + ¢?)~1/? 


in the sense that 


Palat,  |t| <1, |x| <1. 4.20 
E -5 ltl 2 2) 


To prove this identity we note that, for every fixed x in [—1, 1], the left-hand 
side of (4.20) is an analytic function of t in the interval (—1, 1), which we denote 


Co 


fte) =X ae. leet, (4.21) 


n=0 
This function satisfies the differential equation 
of 
Ot 


Substituting the series representation (4.21) into (4.22) leads to the recursion 
relation 


(1 — 2at + t’) 


= (x —2)f. (4.22) 


(n+ Lan41(@) + nan-ı(x) = (2n+1)ran(xz), nENo, 


which is the same as the relation (4.15) between P,+41, Pn-1, and Ph. The first 


two coefficients in the Taylor series expansion of (1 — 2æt + t?)!/? are 


aolx) = 1 = P(x), a(x) =x = P(x), 


therefore it follows that a(x) = P(x) for all n € No. 
Setting x = 1 and x = —1 in (4.20), we obtain 


1 1 
P,(1)t? = = = e 
> (1) 1—2t+t2 l-t 2 
P, (—1)t” = = = —1)"t". 
> (A 1+2+¢ I1+t 2 ) 


Hence 
Pa(1)= 1, Pa(—1) = (—1)” forall n € No. 
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EXERCISES 


4.7 Use the Rodrigues formula to prove the identity (4.14), then conclude 
that 


ff Paltyat= z 1Pasale) — Pra) 


1 
i P,(t)dt =0 foralln EN. 


—1 


4.8 Prove the identity (4.15). 


4.9 Show in detail how Equation (4.22) follows from Equation (4.21), 
and how the two equations imply the recursion relation for ay. 


4.10 Show that if 0 < r < 1, then 


T = 5 Pal (cos 6)r 


Hence deduce that, if x and y are nonzero vectors in R3 such that 
r = ||x|| / |ly|| < 1 and @ is the angle between them, then 


my (cos 6)r 


ly- ar Iyl 


In particular, if ||y|| = 1, then 


-5 P,,(cos 0) ||x||” . 


Thus, if two masses are oe at x and y, then the gravitational 


val — x|| 

potential between them may be represented (up to a multiplicative 
constant) by a power series in ||x|| with coefficients P,,(cos 0). 

4.11 Expand each of the following functions in Legendre polynomials. 
(a) f(z) =1-—23, -l<2<1. 
b) f(z) =|al, -1<@ <1. 
Compute the first six terms (n = 0 up to n = 5) in (b). 

4.12 Use Equation (4.15) to prove that 
n || Pall? = (2n-1) (@Pn—1, Pa), || Pr—all? = (2n+1) (£ Pr, Pr—1) 5 


and hence conclude that ||P,||? = 2/(2n + 1). 


4.3 Hermite and Laguerre Polynomials 141 


4.13 Expand the function 


f= { 7” -l<2<0 


1, 0<xr<1 


in a Legendre series. Use the formulas in Exercise 4.7 to evaluate the 
coefficients. Compute the value of the series at x = 0 and compare 
it with the average value of f(0*) and f(07). 


4.14 Give two Legendre series expansions of the function f(x) = x on 
(0, 1], one in even degree polynomials, and the other in odd degree. 
Is the expansion valid at x = 0? Repeat the procedure if the function 
is f(x) =1 on [0,1]. 


4.3 Hermite and Laguerre Polynomials 
4.3.1 Hermite Polynomials 


Legendre’s polynomials were defined as a set of solutions of a certain differential 
equation (Legendre’s equation). The same procedure may be followed to define 
the Hermite polynomials. But, instead, we start by defining these polynomials, 
then deduce the equation (Hermite’s equation) that they satisfy, although this 
is not the natural order in which the theory developed; for both equations arise 
naturally in mathematical physics, as we show later in this chapter. 


For each n € No, the function Hn : R — R is defined by 


2 d” 2 
H,,(x) = (—1)"e* —e™. 4.2 
(2) = (-1)"e" Te (4.23) 
This yields a sequence of polynomials 
H(z) E 1, A(x) T 22, 
Ho(x) = 4z? — 2, H3(x) = 823 — 122, 
3 


H4(x) = 16x4 — 482? + 12, H5(x) = 32x° — 160x? + 120x, 


known as Hermite polynomials, named after the French mathematician Charles 
Hermite (1822-1901). Using the formula (4.23), we arrive at the following prop- 
erties for the set {Hn : n € No}. 
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1. H, is a polynomial of degree n. 


Proof 
oe =—2re"* 
ae = (—22)?e"* 2e-*" 
fe = (—22)%e 2? L 12re™? 


By induction, we obtain 


dr 
dx” 


where p is a polynomial of degree less than n. Therefore 


e7% = (—2r) e% + p(a)e™, 


Hn (2) = (—1)"e* [(-22)" + p(e)je 
= (2x)” + (—1)” p(x). (4.24) 


2. The set {H, : n € No} is orthogonal in £?_,.(R). 


Proof 


Assuming m < n, 
Cane i Hy (2) Hy(a)e~® dx 
=" f Hala) 2ed 
= ins mla) Fre z. 


Integrating by parts n times, and noting that, for any polynomial p, 
p(x)e 


2 . 
T7 — 0 as |z| > œ, we obtain 


(Hrn, Hn) e—a = (—1)™ 2 | : 


Consequently, Hm is orthogonal to H, for all m < n and hence, by the 
symmetry of the inner product, for all m Æ n. 
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3. || All? = Pn! 7. 


Proof 


l 
g 8 
| en | 
Q 
SS 
= 
8 
aes 
fav) 
| 
8 
aQ 
8 


where the last equality follows from Equation (4.24). But 


i e7? dx = Vr 


—co 


(see Exercise 4.15), hence we arrive at the desired equality. 


4. For every x € R, 


= ly 
e?z i? eA. (4 25) 
n= snl 
In other words, e2%-* is a generating function for the Hermite 
polynomials. 
Proof 


For every x € R the function f(t, £) = e2"'-© is analytic in t over R, and 


is therefore represented by the power series 


2 18” 
CO: es 
o ` t= 


n= 


n 


Because 
o” f Z o” x?—(x—t) 
Ol egy, Ole eb 
= 2? O 0t)? 
ot” E 
2 d” 2 
= (-1)"e” =e * 
dy TE 
2 d” 2 
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where y = x — t, it follows that 
ee ee 
es dx” 


Theorem 4.2 
Hn, satisfies the second-order differential equation 

u” — 2ru' + 2nu=0, «eR. (4.26) 
Proof 
Differentiating the identity (4.25) with respect to x, we obtain 

at—t? < 1 n 
WPR = ye En (at, (4.27) 
n=1 


where the left-hand side can also be written as 


se 1 
Iperi? =2 — Hn (x)t" t 
n: 

n=0 


n+1 
=2X ——— FY sia 
7 mie 


-255 — Hy_1(x)t”. (4.28) 


By comparing Equations (4.27) and (4.28) we arrive at the relation 
H! (x) = 2nHn—1 (£), neN. (4.29) 
Differentiating (4.25) with ee to t gives 


2(x — e2zt— t? r) 1 -5 -Hn (e : 
merit 
from which 
aD Ame = 25° Halo + So Haat) 
n=0 nao. 
“nti 
D mt GED u n +1(2 
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Comparing the two sides of this equation, we see that 


2xHo(x) = Hı (x) 
2¢H, (x) = 2nHn-1 (£) + Hn (£), neN. (4.30) 


Combining Equations (4.29) and (4.30) gives 
2xHn (£) = H| (£) + Hn41 (£), 


and differentiating this last equation, we end up with 


Ay (@) = 22H; (a) + 2Hp (a) — Hp 41 (2) 
= 2H) (x) + 2H,,(x) — (2n + 2)Hn (£) 
= 22H) (x) — 2n Hn (£). 


Equation (4.26) is called Hermite’s equation, and we have shown that one 
solution of this equation is the Hermite polynomial H,,. As with Legendre’s 
equation, the other solution turns out to be an analytic function which can be 


represented by a power series in x (see Exercise (4.21)). Multiplication by e~®” 
converts Hermite’s equation to the standard Sturm—Liouville form 
(euw) +2ne" u=0, ER, (4.31) 


in which p(x) = e72, A= 2n, p(x) = e7?’ , and the differential operator 


d z2 d 
dx ; dx 


is formally self-adjoint. Because H, (x£), but not the infinite series solution (see 
Exercise 4.22), lies in L? (R), we therefore conclude that the sequence of 
Hermite polynomials (H, : n € No) forms a complete orthogonal system in 
L?_2(R). 

So 


4.3.2 Laguerre Polynomials 


The differential equation 
zu” + (1— sju +nu=0, 0<zr<%, nE€No, (4.32) 


is known as Laguerre’s equation, after the French mathematician Edmond 
Laguerre (1834-1886). Its standard form 


(ze "u y) + ne *u = 0 (4.33) 
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is obtained by multiplication by e`”, the weight function in this case. Note 
that p(x) = xe~* vanishes at x = 0, so no boundary condition is needed at 
this point. The nonsingular solutions of Laguerre’s equation are given, up to 
an arbitrary multiplicative constant, by the Laguerre polynomials 


Lolz) = 1, 
Lı(z)=1- z, 


1 
La(x) = 1 — 2x + z7 


e 
L ae 


where L,, has degree n. Because L, are eigenfunctions of the SL operator in 
Equation (4.33), we would expect to have (Lm, Ln)e-s = 0 for all m Æ n, and 
this follows immediately from the observation that, for all m < n, 


(grer =); (4.34) 


co Qa 


(a"e~*)dx = 0. 


Py ee 


Furthermore, 
(2", Ln) = = n" f xe “dx = (-1)"nl, 
0 


and because the coefficient of x” in Ln is (—1)"/n!, it follows that 


Lal? = (Ln, Ln) = Al (x”, Ln) =1 
for all n, and the Laguerre polynomials are in fact orthonormal (and complete) 
in £2, (0,00). 
EXERCISES 
4.15 Show that f% e7? dx = /T by using the equality 


(fort) = [feat 


Hint: Evaluate the double integral by transforming to polar coordi- 
nates. 
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4.16 Show that the powers of x in the polynomial H,,(x) are even if n is 
even, and odd if n is odd. 


4.17 Prove that, for all n € No, 


Fz, (0) = ae) era Hən+1(0) = 
4.18 Use induction to prove 
[n/2] s 
27)” 2k 
SaS (yee 
H,,(z) pa 1) Hla 


4.19 Expand the function f(x) = 2”, m € No, in Hermite polynomials. 
4.20 Expand the function 


f@={ FS) 


; xr>0 
in terms of Hermite polynomials. Compute the first five terms. 
4.21 The differential equation 
u” — 2xu' + 2du = 0, 


in which is not necessarily an integer, is a generalized version of 
Equation (4.26), and is also called Hermite’s equation. 


(a) Assuming that u(x) = 09 c.2**" and substituting into the 
equation, show that 


2(k +r —X) 
k+r+2)(k+r+1 


Ck+2 = ( Ta k € No. 


(b) If r = 0, the solution is 


keisi 2(A = 1) 3 J 22(A—1)(à— 3) 5 


3! 5! 


(c) Conclude that the general solution of Hermite’s equation for any 
real A is u(x) = couo(x)+c1uı (x), where co and c; are constants. 
What happens when A is a nonnegative integer? 
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4.22 With up and u; as in Exercise 4.21, prove that each of the functions 
e ?’ u(x) tends to a constant as |x| > oo, and that 
this constant is 0 if, and only if, the solution is H,,. Conclude from 
this that the only solutions of Hermite’s equation in Le a (R) are 
the Hermite polynomials. 


>g x 


*ug(x) and e~ 


4.23 Prove that 


where 


are the binomial coefficients. 


3 


4.24 Express x? — x as a linear combination of Laguerre polynomials. 


4.25 Expand f(x) = x™, where m is a positive integer, in terms of 
Laguerre polynomials. 


4.26 Expand f(a) = e*/? in a Laguerre series on [0, 00). 
4.27 Verify that L, satisfies Equation (4.32). 


4.28 Determine the general solution of Laguerre’s equation when n = 0. 


4.4 Physical Applications 


The orthogonal polynomials discussed in this chapter have historically been 
intimately tied up with the study of potential fields and, more recently, quan- 
tum mechanics. Here we look at two typical examples where the Legendre and 
Hermite polynomials crop up. The first is in the description of an electrostatic 
field generated by a spherical capacitor, as a result of solving Laplace’s equa- 
tion in spherical coordinates. The second is in representing the wave function 
for a harmonic oscillator. 


4.4.1 Laplace’s Equation 


In R? the second-order partial differential equation 


u u u 
u aa a (4.35) 
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oe | 


Figure 4.2 Spherical coordinates. 


is called Laplace’s equation, after the French mathematician Pierre de Laplace 
(1749-1827). It provides a mathematical model for a number of significant phys- 
ical phenomena, such as the static distribution of an electric, gravitational, or 
temperature field in free space, as pointed out in Section 3.3. In this context the 
scalar function u = u(x, y, z) represents the electric potential, the gravitational 
potential, or the temperature at the point (x,y,z). Under the transformation 
from Cartesian coordinates (x,y,z) to spherical coordinates (r, 0, p), defined 
by (see Figure 4.2) 


x = r cos 0 sin 


y = rsin f sin y 


z = r cos p, 


where r > 0, =n < 89 < m, 0 < p < T, Equation (4.35) takes the form 


ð / ðu 1 Ou 1 oO Ou 
iny~— }=0. 4.36 
Or (« =) sin? p 00? siny Op (sin e2) ee) 
Assuming that the function u is symmetric with respect to the z-axis, it will 
be independent of 0 and Equation (4.36) becomes 


Ə / ðu 1 Of, Ou 
| -— = K 4. 
Or ¢ Or ) sin y Op (sin % Z) j (ea 
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We use the method of separation of variables to solve Equation (4.37), so 
we assume that u(r, p) is a product of a function of r and a function of y, 


u(r, p) = o(r)w(y), 


then substitute into (4.37). After division by vw this gives 


1d / adv 1 d dw 
= i 4. 
vdr (« z) wsin y dp (sne) i (35) 


where the left-hand side depends only on r, and the right-hand side depends 
only on y. Assuming Equation (4.38) holds over a domain 2 (open, connected 
set) of the ry-plane, each side must be a constant, which we denote A. Thus 


we obtain the pair of equations 


r?u" + 2rv' — Au = 0, (4.39) 


— (sin pw’)! + Aw = 0. (4.40) 
siny 


Setting € = cosy in Equation (4.40), and noting that 
d č dd i d 
dp dode dé’ 
1 d dw d a dw 
Tp dp (sme) dé [ OF 
leads to Legendre’s equation 


d 2, dw] | = 
dé fja aZ] + Aw = 0. 


If X= n(n+1), n € No, then the solutions of Equation (4.40) are given by the 
Legendre polynomials 


Wn(~) = Pn(€) = Pn(cos p). 


Equation (4.39), on the other hand, is of the Cauchy—Euler type and the 
substitution u(r) = r“ yields 


lala — 1) + 2a — n(n + 1)]r“ = 0, 


which implies a = n or a = —n — 1, hence 
Un(r) = anr” + br 


for some arbitrary constants an and bn. Thus the sequence 


Un(T, 2) = vn (r)wn (p) = (anr™ + bar” 1) P,, (cos o) 
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satisfies Laplace’s equation (4.37) for each n € No, and the complete solution 
of the equation is therefore formally given by 


u(r, p) = So u(r, o) 


= X (anr” + bar” +) P, (cos g). (4.41) 
n=0 


Two observations are worth noting in connection with the representation 
(4.41) of the solution to Laplace’s equation. The first is that the Legendre func- 
tions Q,,(cos p) were not taken into account, because these functions become 
unbounded along the z-axis (where cos y = £1). For the same reason the terms 
bnr”! are dropped (by setting b,, = 0) if the domain 2 includes the origin 
r = 0. The second observation, already mentioned in Remark 3.16, is that the 
generality lost in the assumption that u(r, p) is a product of a function of r 
and a function of ọ is restored by taking a linear combination of these solutions 
as we do in (4.41), P,, being complete in £?(—1,1). Of course the constants an 
and bn are determined by additional (boundary) conditions. 


Suppose, for example, that u(r,y) satisfies Laplace’s equation inside the 
ball 0 < r < Rand that u(R, p) = f(y) on the spherical surface r = R, where 
f is a given function in £?(0, 7). Using (4.41) with bn = 0 for all n, 


u(r, p) = 5 anr” Pa (cos p). (4.42) 


n=0 


Applying the boundary condition at r = R, we obtain 


u(R,) = Y` an R” P, (cosy) = f(y), 


n=0 


from which we conclude that a,R” are the Fourier-Legendre coefficients of 
f(y) when expanded in terms of P, (cos y). Thus 


Gy, R” = (fs Pa) 
I|_ Pr ll 
2n+1 ft 
= TEL S AEPD, 
and therefore 
2n +1 f" 
an = Tf f(v)Pn(cosy)sny dy, neENo. (4.43) 
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Figure 4.3 Spherical capacitor. 


Outside the ball, in the region r > R, the nonnegative powers of r become 
unbounded and their coefficients must therefore vanish. Hence the solution in 
this case is given by 


u(r, p) = 5 bar”! Pa (cos g), 


n=0 
where 7 i z 
bn = ae | f(v)Pn(cos y)sin y dọ. 
0 


Example 4.3 


If opposite electric charges are placed on two hemispherical conducting sheets, 
which are insulated from each other, an electric field is generated between them, 
both inside and outside the spherical surface. The apparatus is called an electric 
capacitor (see Figure 4.3). Suppose the radius of the spherical capacitor is 1. If 
the upper hemisphere has potential 1, and the lower hemisphere has potential 
—1, determine the potential function inside the sphere. 


Solution 


Let u denote the potential function of the electric field at any point in space 
outside the conducting surfaces. The electric potential (and hence the elec- 
tric charge) on each hemisphere is distributed symmetrically about the z-axis, 
therefore u depends only on r and y. Inside the sphere it has the form (4.42). 
On the spherical surface, 
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1 0<y<7/2 
1 = ? a 
whe) { —1, r/2<p<r. 


Applying the formula (4.43), with R = 1 and f(y) = u(1, 9), 


In +1 m/2 a 
ie | Eu cOs ynia ip +f (—1)Pn(cos y)sin p | 
9 T/2 


2+1 f! 
= I [Pa(é) — Pa(—€)]a€. 


Therefore an = 0 for even values of n, and 


_ n+l 


Aan = 


f 2P@ag= (n41) f Pag 
0 0 


when n is odd. Thus 


Based on the result of Exercise 4.7, a general formula for an, when n is odd, is 
given by 


iy = (20 +1) | P, (£)dé 
= rer (1) — re (0). 


The potential inside the capacitor is therefore 


3 7 11 
u(r, ~) = 57 Pi (cos y) - g” Ps(cos y) + ig” (cos yp) +: 


4.4.2 Harmonic Oscillator 


In quantum mechanics the state of a particle which is constrained to move on a 
straight line (the z-axis) is described by a wave function W(z,t). If the motion 
is harmonic, the dependence on time t is given by 


P(x, t) = pla)e 7", 
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where E is the total energy of the particle and A is a universal constant known 
as Plank’s constant. The location of the particle is then determined by the 
function y(x) in the sense that |:b(x)|? /||l|? is a measure of its probability 
density in R. If the particle is located in a potential field V(x), the function 7 
satisfies the time-independent Schrédinger equation 

m 
Fa 


Y” + A [E — V (x)]y = 0. 


By defining A = 2mE/h? and r(x) = —2mV(x)/h?, Schrödinger’s equation 
takes the form of a standard Sturm—Liouville eigenvalue equation 


w+ [A+ r(x)]y = 0. (4.44) 


On the interval —co < x < œ, the condition that p — 0 as x — too defines a 
singular SL problem. 

When V(x) is proportional to x”, the resulting system is called a harmonic 
oscillator. For the sake of simplicity we take V(x) = h?x?/2m, so that Equation 
(4.44) becomes 


wo +(A—27)p = 0. (4.45) 
If we set u(x) = e? /2y(zx), it is a simple matter to show that 
u” — 2xu' + (A-1)u=0, 


which is Hermite’s equation when A = 2n + 1. Its solution is therefore Hn. 
Consequently, the eigenvalues of Equation (4.45) are 


An =2n+1, nENo, 


which determine the (admissible) energy levels E, = (2n + 1)h?/2m of the 
particle in the harmonic oscillator, and the corresponding eigenfunctions are 
given by 

Pala) =e"* Hala), nENo, 


which represent the wave functions of the particle in the admissible energy 
levels. The functions e~*”/?H,,(x), which belong to £2(R), are called Hermite 
functions. The probability that the particle is located in the interval (a,b) is 
given by 


1 b b p2 -2d 
— — f 2(n)e-® dz = Ja nfte = . 
|Enll-22 Ja HA ae da 


on 
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EXERCISES 


4.29 In Example 4.3, determine the surface where u(r, p) = 0. 


4.30 In Example 4.3, determine the potential function outside the sphere 
row, 


4.31 Find the solution u(r, y) of Laplace’s equation inside a sphere of 


radius R if 
_ f 10, 0<y<T/2 
WR) = { 0, T/L PLIT. 


4.32 Suppose u satisfies Laplace’s equation inside the hemisphere 0 < r < 
1,0 < ọ < 7/2. If u(r,n/2)=0on0 <r < 1 and u(1,y) = 1 on 
0 < yp < T/2, show that 


u(r, p) = X (-1)" & =) Speen Pans (00s p) 


n=0 
inside the hemisphere. 
4.33 Suppose u satisfies Laplace’s equation inside the hemisphere 0 < r < 
R, 0< p < 7/2. If u(r, n/2)=00n0 <r < R, and u(R, p) = fẹ) 
on 0 < y < 7/2, show that 


Co 


u(r, p) = Xo Con (z)" P2n,(cos 9), 


n=0 


n/2 
TE ee i Flo) Pamlor y)sin yp dp. 
0 
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Bessel Functions 


We start by presenting the gamma function and some of its properties. This 
function is used to define the Bessel functions, hence its relevance to the subject 
of this chapter. 


5.1 The Gamma Function 
The gamma function is defined for all x > 0 by the improper integral 
T(x) = Í e`” ldt. (5.1) 
0 


This integral converges for all positive values of x, therefore it clearly represents 
a continuous function on (0,00). In fact we can show that I" is of class C% on 
(0,00) (see Exercise 5.1). Integrating by parts, we have 


Co Co 
r(z+1)= J e~t dt = -ettr + al ms aa 
0 0 


Co 
=x f ee ldt, 
0 


which gives the characteristic recursion relation of the gamma function 


r(x+1)= r(x), x > 0. (5.2) 
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If x = n is a positive integer, then 


TI(n+1) =nI(n) 
=n(n—1)P(n—-1) 


Sani 


With 


By Se ital, 


we have 
I'(n+1)=n! 
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which means that the gamma function I" is an extension of the factorial map- 


ping n > (n — 1)! from N to (0, 00). 
Equation (5.2) implies 
rT 1 
igea 


ax 


where the right-hand side can be extended to (—1, 0) U (0,00). Since 


lim P(e +1) =F) =1, 


the gamma function has a simple pole at x = 0. Using the relation (5.2) re- 


peatedly, we see that 


r(x) = 


I(a+n) 
a(x+1)---(e@+n—1)’ 


This allows us to extend T from (0, 00) to R, except for the integers 0,—1,—2,.. 


where I has simple poles (see Figure 5.1). Because 


neN. 


I(x) >| eres] e ‘dt=e' 
af 1 


for all x > 1, and T(x +1) = «I (x), we also conclude that I(x) tends to co 


as T —> O0. 


k 
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ya 


Figure 5.1 The gamma function. 


EXERCISES 


5.1 Prove that the gamma function, as defined by (5.1), belongs to 
C™(0, 00). 


5.2 Prove that (1/2) = yr. 


5.3 Prove that 
(2n)! yT 


nlan? n € No. 


I(n+1/2)= 
5.4 The beta function is defined by 
1 
PE | Pupi ssis 
0 


(a) Use the transformation u = t/(1 — t) to obtain 


oo u?! 
pan = f Guyer 


(b) Prove that, for any s > 0, 


I(z)= m e*t ldt. 
0 
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(c) With s = u + 1 and z = z + y, show that 


1 1 oo 
= —(u+l)tgety—l gy 
(u+ let ou. g 


then use (a) to obtain the relation 


T(x) (y) 


5.5 Prove that 
Peres) 


92T 
r (2x) 


= Dir. 


5.6 The error function on R is defined by the integral 


2 T 2 
erf(x) = a e™% dt. 


Sketch the graph of erf(x), and prove the following properties of the 
function: 


(a) erf(—x) = — erf (x). 
(b) limg++oo erf(x) = +1. 


(c) erf(a) is analytic in R. 


5.2 Bessel Functions of the First Kind 


The differential equation 


xy" + xy! + (£? — v*)y = 0, (5.3) 
where v is a nonnegative parameter, comes up in some situations where separa- 
tion of variables is used to solve partial differential equations, as we show later 
in this chapter. It is called Bessel’s equation, and we show that it is another 
example of an SL eigenvalue equation which generates certain special functions, 
called Bessel functions, in much the same way that the orthogonal polynomials 
of Chapter 4 were obtained. The main difference is that Bessel functions are 
not polynomials, and their orthogonality property is somewhat different. 


Equation (5.3) has a singular point at x = 0, so we cannot expand the 
solution in a power series about that point. Instead, we use a method due to 
Georg Frobenius (1849-1917) to construct a solution in terms of real powers 
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(not necessarily integers) of x. The method is based on the premise that every 
equation of the form 


y+ Wy + re, =0, 
x x 
where the functions q and r are analytic at x = 0, has a solution of the form 
oo 
y(x) = a" yy cyt” = a*(cg t+ ca + con? +-+), (5.4) 
k=0 


in which t is a real (or complex) number and the constant co is nonzero [12]. 
Clearly t can always be chosen so that cg Æ 0. The expression (5.4) becomes a 
power series when t is a nonnegative integer. 

Substituting the expression (5.4) into Equation (5.3), we obtain 


5 (k+t)(k+t— 1)ceps" tt + XO(k H tera" t 
k=0 k=0 


lo) co 
Ca 5 e = vey aE 2 
k=0 k=0 


or 


co oO CoO 
> (k + t)?cpr"tt + > cpat ttt — p? > cpe tt = 
k=0 k=0 k=0 
Collecting the coefficients of the powers xt, a't!, x'+?,..., xtti, we obtain the 


following equations 


tco — v*cp = 0 
(+1)? -rca =0 
(t + 2)c2 — veg + co = 0 


(t+ 9)°c; — ve; + cj- = 0. (5.7) 


From Equation (5.5) we conclude that t = +v. 
Assuming, to begin with, that t = v, Equation (5.6) becomes 
(+1? — ve, = Qv+ 1)e, = 0. 
Since 2v +1 > 1, this implies cı = 0. Now Equation (5.7) yields 
[w +j)? — v7 Je + cj-2 = GG + 2v)c; + cj-2 = 0, 
and therefore 


1 
G+) 
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Because cı = 0, it follows that c; = 0 for all odd values of j, and we can assume 
that 7 = 2m, where m is a positive integer. The recursion relation (5.8) now 
takes the form 


1 1 
Witt tae ae Mm — SS AROS oe m— 4s N, 
É 2m(2m + 2v) SRE 22m(v + m) ai mS 
which allows us to express C2, C4,Cg,-.-. in terms of the arbitrary constant co: 
1 
C= -= 
=~ Ry +1) 
1 1 
— = Ei 
“=u AA o2) 
1 
= BSI F No 3) 
—1)™ 
C2m = ( ) Co. (5.9) 


22mm\ (vy + 1)(v + 2)---(v+m) 


The resulting solution of Bessel’s equation is therefore the formal series 


x ye ete ae (5.10) 
m=0 
By choosing 
1 
= — 5.11 
= TP +1) Oy) 


the coefficients (5.9) are given by 


(=F 


~ Qvt2mmI (vy +m-+1) 


C2m 


The resulting series 


v = (1) 2m 
ý 2 wmm (vtm 1) 


is called Bessel’s function of the first kind of order v, and is denoted J, (x). 


Thus 
Gog O re 1) G s (912) 


and it is a simple matter to verify that the power series 


2° (ye 


2 2mm (m+v +1) 


2m 
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converges on R by the ratio test. With v > 0 the power x” is well defined when 
x is positive, hence the Bessel function J, (a) is well defined by (5.12) on (0, 00). 


Since 
f ft, v=0 
pan, a= { 0, v >0, 


the function J, may be extended as a continuous function to [0,0co) by the 
definition J,(0) = lim,_,9+ Jy (x) for all v > 0. 


Now if we set t = —v < 0 in (5.4), that is, if we change the sign of v in 
(5.12), then 
PV aU a (-1)™ gym 
J (£)= (5) ( ) 0, 5.13 
@=(5 peers 2 fe One 


remains a solution of Bessel’s equation, because the equation is invariant under 
such a change of sign. But it will not necessarily be bounded at x = 0, as we 
show in the next theorem. 


Theorem 5.1 


The Bessel functions J, and J_, are linearly independent if, and only if, v is 
not an integer. 


Proof 


If v = n € No, then 


Je) Ga 3 me +1) ee i 


But because 1/I (m — n + 1) = 0 for all m — n +1 < 0, the terms in which 
m = 0,1,...,n — 1 all vanish, and we end up with 


Jon) = ee 2 mL F1) G” 


ron = —])\m+n N2M+2N 
7 (H De Eanes (o ` 


m=0 


Co 


=A Gr a m 1) ee 


m=0 


= (-1)"Jn(2). 
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Now suppose v > 0, v ¢ No, and let 


aJ,(x) + bJ_,(x) = 0. (5.14) 


Taking the limit as t — 0 in this equation, we have lim,_,9+ J, (a) = 0 whereas 
lim,_.9+ |J_,| = oo because the first term in the series (5.13), which is 


mayl) 


dominates all the other terms and tends to too. Thus the equality (5.14) cannot 
hold on (0,00) unless b = 0, in which case a = 0 as well. 


Based on this theorem we therefore conclude that, when v is not an integer, 
the general solution of Bessel’s equation on (0, 00) is given by y(x) = cı JL (£) + 
coJ_,(x). The general solution when v is an integer will have to await the 
definition of Bessel’s function of the second kind in Section 5.3. 


In the following example we prove the first of several identities involving 
the Bessel functions. 


Example 5.2 


d 
ge rel = -x Jp41 (£), x>0d,v>0. (5.15) 


Proof 


x” J,(x) is a power series, so it can be differentiated term by term: 


d —v SN d < (-1)” 2m 
a Aue) = dx > 22m+Ym IP (m +v + n? 


m=0 


= 5 (-1)"2m 2m—1 
= x 
fa mtem (m+v +1) 


= -g ” S Eps 2m+v+1 


2m+vtim! I (m +v + D7 


m=0 


== =r” Jp41. 
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Bessel’s functions of integral order, given by 


9. (-1)™ g\ 2m+n 
In(x) = 2 i (5) , o nE€No, (5.16) 


are analytic in (0,00), and have analytic extensions to R as even or odd func- 
tions, depending on whether n is even or odd. We have already established, 
in Example 2.13, that Jo has an infinite set of isolated zeros in (0,00) which 
accumulate at oo. We can arrange these in an increasing sequence 


£01 < S02 < 03 < 


such that éo — œ as k — oo. Using mathematical induction, we can show 
that the same is also true of J, for any positive integer n: Suppose that the 
zeros of Jm, for any positive integer m, is an increasing sequence (Emp : k € N) 
n (0,00) which tends to oo. The function z~ J,,(x) vanishes at any pair of 
consecutive zeros, say Emp and €,,, ,41, therefore it follows from Rolle’s theorem 
that there is at least one point between Emp and m, ppı where the derivative 
of x` ™ J,,(x) vanishes. In view of the identity (5.15), 
d 


Jm+1(£) = lr [17 Im()], 
£ 


hence Jm4+i(x) has at least one zero between mg and Em +1: Thus we have 
proved the following. 


Theorem 5.3 


For any n € No, the equation J,(#) = 0 has an infinite number of positive 
roots, which form an increasing sequence 


Eni < En2 < Ens < 


such that é „ą — œ as k > œo. 


The first two Bessel functions of integral order are 


Haes o" 


r2 rt xê 
= iT + art 
22(1 = Faa 21)2  26(31)2 
cs £N 2m+1 
HS ai we l G 
F x x x 
~ 2 231121 252131 27314! ` 
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=y 


Figure 5.2 Bessel functions Jo and Jı. 


The similarities between these two expansions on the one hand, and those of the 
cosine and sine functions, are quite striking. The graphs of Jo(x) and Jı(x) in 
Figure 5.2 also exhibit many of the properties of cos x and sin x, respectively, 
such as the behaviour near x = 0 and the interlacing of their zeros. When 
we set v = 0 in the identity (5.15) we obtain the relation Jj(x) = -Jı (x), 
which corresponds to the familiar relation (cos)'x = — sin x. But, unlike the 
situation with trigonometric functions, the distribution of the zeros of J, is 
not uniform in general (see, however, Exercise 5.8) and the amplitude of the 
function decreases with increasing x (Exercise 5.32). 

In the next example we prove another important relation between Jo and 
Jı, one which we have occasion to resort to later in this chapter. 


Example 5.4 


| tJo(t)dt = Jı (x) for all z > 0. 
0 


Proof 


Í 
Ms 

= 

Y 

3 

+ 

N 

$ 
5 


z£ ) 2m+1 


> = xtJı(x). 


II 
8 
Ms 
z 
alL 
E 
-A 
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EXERCISES 


5.7 


5.8 


5.9 


5.10 


5.11 


5.12 


5.13 


5.14 


5.15 


Verify that the power series which represents «~” J, (a) converges on 
R for every v > 0. 


Prove that 


Re 2 
Ji j2(x) = y zg sine, J_1j2(z) = \ a, Cos, 


and sketch these two functions. 


Prove that xJ; (x) = vJ (x) — «Jy4i(x), and hence the identity of 
Example 5.2. 


Use Exercises 5.8 and 5.9 to prove that 


3 /si 
J3/9(x) =4/ — (= — cose) : 


TL x 


Prove the identity |x” J (x)| = x” Jp_1(x), and hence conclude that 


2 /cosz 
J-sp(x) = -4 = ( 


+ sin) i 
TX 


Use the identities of Example 5.2 and Exercise 5.11 to establish 


Fifa) = F151 (2) — Jana). 


Prove that 9 
v 
Jvai(“) + Jp-1(x) = = ele). 
Derive the following relations: 


(a) ihe tJ (t)dt = 2xJ;(x) — x? Jo(z). 


(b) fi Js(t)dt = 1 — F(x) — 231 (x)/z. 
Use the identities Jj(a) = —Ji(a) and [xJ,(x)|’ = aJo(x) to prove 
that 


f t” Jo(t)dt = x" J, (x)+(n—1)2” 1 Ja(x)—(n—1)? i t? Jo(t) dt. 
0 0 
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5.16 Verify that the Wronskian W(x) = W (Jp, J_v), where v ¢ No, sat- 
isfies the equation rW’ + W = 0, and thereby prove that 
2 


WO = Sarde 


Using the result of Exercise 5.4(c), and evaluating the integral ex- 
pression for 3(v,1—v) by contour integration, it can be shown that 
—2 sin VT 

TT 


(also see [8]). 


5.3 Bessel Functions of the Second Kind 


In view of Theorem 5.1, it is natural to ask what the general solution of Bessel’s 
equation looks like when v is an integer n. There are several ways we can 
define a second solution to Bessel’s equation which is independent of J, (see, 
for example, Exercise 5.17). The more common approach is to define Bessel’s 
function of the second kind of order v by 


1 
= svr — J-,(x)], 0,1,2,... 
T a zl (x)cos vr (x)] v +£ 
limyonY); n= 0,1 ,2)4.00-% 


In this connection, observe the following points: 

(i) For noninteger values of v, Y, is a linear combination of J, and J_,. Because 
J_, is linearly independent of Jp, so is Y,. 

(ii) When v = n, the above definition gives the indeterminate form 0/0. By 
L’H6pital’s rule, using the differentiability properties of power series, 


no = GE -erae 
= 2 (loe§ +1) n- 5 (5)" Do e (p 
a) a a Se 
where 


1 1 1 
ho = hm = 1 — — e... E 
o=0, ree 
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and 
y= lim (hm — log m) = 0.577215--- 


is called Euler’s constant. Note that the last sum in the expression for Yp 
vanishes when n = 0, and that the presence of the term log xJ;,(x) implies Y, 
is linearly independent of Jn. That the passage to the limit as v — n preserves 
Yn as a solution of Bessel’s equation is due to the continuity of Bessel’s equation 
and Y, with respect to v. For more details on the computations which lead to 
the representation of Y, given above, the reader is referred to [17], the classical 
reference on Bessel functions. 


The asymptotic behaviour of Yp(x) as x — 0 is given by 


2 
= log 5. n=0 
T 
Yn ~ 5.17 
(x) (n—1)! Cm JEN ( ) 
T 2 +i ti 


where f(x) ~ g(x) as x > c means f(x)/g(x) — 1 as x > c. Thus Y,,(2) is 
unbounded in the neighborhood of x = 0. As x —> 0, 

21 41 

» V(x) ~ 


TE T2 


2 
l 
l 
| 


2 
Yo(x) ~ z log a, Yi (a 


In view of Theorem 5.3 and the Sturm separation theorem, we now conclude 
that Y, has an infinite sequence of zeros in (0,00) which alternate with the 
zeros of Jn. Yo and Y; are shown in Figure 5.3. 


yh 
Yo 


=y 


Figure 5.3 Bessel functions Yo and Y}. 
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EXERCISES 


5.17 


5.18 


5.19 


5.20 


5.21 
5.22 


5.23 


5.24 


5.25 


Show that the function 


E aes [ Jg“ 


satisfies Bessel’s equation and is linearly independent of Jn (£). 


Verify the asymptotic behaviour of Yo and Y; near x = 0 as expressed 
by (5.17). 


Prove that 


Prove that 


ge Y) = -x "Y, ,+1 (x). 


Prove that Y_p(x) = (—1)” Y, (x) for all n € No. 


The modified Bessel function of the first kind I, is defined on (0, 00) 
by 
I(x) =i" IL(ia), vy>0, 


where i = /—1. Show that I, satisfies the equation 


gy" + cy! — (2? + v7)y = 0. (5.18) 


Based on the definition of J, in Exercise 5.22, show that J, is a real 
function represented by the series 


Co 


i An) = 2, AEE 1) (G. 


Prove that I,(x) #0 for any x > 0, and that I-n(£) = In(£x) for all 
nEeN. 


Show that the modified Bessel function of the second kind 
T 
[I_L(x) — I (x)] 


also satisfies Equation (5.18). Jo and Ko are shown in Figure 5.4. 


K, (2) 


~ sin VT 


5.4 Integral Forms of the Bessel Function Jn 171 


Figure 5.4 Modified Bessel functions Jj and Ko. 


5.4 Integral Forms of the Bessel Function Jn 


We first prove that the generating function of J, is 


e7(2=1/2)/2 — 5S Jn z £0. (5.19) 


n=— oo 


This can be seen by noting that 


on Say 


! 
j=0 7 
—a/2z _ ( 1) (=) 
g 2 kizk \2/ ? 
=0 


and that these two series are absolutely convergent for all x in R, z 4 0, hence 
their product is the double series 


a(z—-1/z)/2 _ T jtk j—k 
£ = ar r (2) ans 


j=0 k=0 


Setting j — k = n, and recalling that 1/(k + n)! = 1/I (k +n + 1) = 0 when 
k+n <0, we obtain 


sti P eee) 


k=0 


No In(x)z 


n=—Co 


zZ 
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which proves (5.19). The substitution z = et? now gives 


hence 


et sin@ _ 5S Jal jens, (5.20) 


n=— Co 


The function e’”*° is periodic with period 27 and satisfies the conditions 
of Theorem 3.9, thus the right-hand side of (5.20) represents its Fourier series 
expansion in exponential form, and J;,(a) are the Fourier coefficients in the 
expansion. Therefore 


1 See ; 
Jal) = al eit sin 8 ,—ind ag 
o 1 
On 
On 


ele sin 0—n8) q9 
cos(xsin@ — n@)d0, because Jn(x) is real, 
df? : 
=— | cos(x sin 0 — né)dé, n € No, (5.21) 
0 


T 


which is the principal integral representation of Jn. The formula (5.21) imme- 
diately gives an upper bound on Jn, 


1 T 
|Jn(a)| < J d0=1 foralln € No, 
0 


a result we would not have been able to infer directly from the series definition. 
It also confirms that Jo(0) = 1 and J,,(0) = 0 for all n > 1. 


Going back to Equation (5.20) and equating the real and the imaginary 
parts of both sides, we have 


cos(x sin 8) => Jn(x)cos nð, 


n=—Co 


sin(x sin 0) a Jn(x)sin nð. 


n=—Co 
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Because J-n(£) = (—1)"J,(x), this implies 


cos(x sin 6) )+2 = J2m(x)cos 2m9, (5.22) 
m=1 
sin(xsin@) =2 X` Jom—1(x)sin(2m — 1)0. (5.23) 
m=1 


Now Equations (5.22) and (5.23) are, respectively, the Fourier series expansions 
of the even function cos(#sin@) and the odd function sin(x sin 0). Hence we 
arrive at the following pair of integral formulas, 


Jom (x) = =f cos(x sin @)cos 2mé dé, m € No, (5.24) 
Jəm-1(£) = F sin(x sin 0)sin(2m — 1)0 dé, meN. (5.25) 
0 
EXERCISES 


5.26 Show that 
1 1 4 + ie a 
J, (2) = = sin @sin(né — z sin 6)dé, 
T Jo 
then use induction, or Equation (5.20), to prove that 


1 T 
JO) = = i sin’ 8 cos(nð — xsin — kr/2)d0,  keN. 
0 


T 


5.27 Use the result of Exercise 5.26 to prove that 24 (a) < 1 for all 
n,k € No. 


5.28 Prove that 
(a) Jo(z) +25 m-1 J2m(2) = 1. 
(b) Jla) +257 (71) Jom (x) = cos x. 
( 
( 


c) 25 9 OL)” ea) = sin z. 
d) Dp (2m — 1)Jzm-1 (£) = 2/2. 
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5.29 Use Parseval’s relation to prove the identity 


Jo (x) +2 3 Intnl 


n=1 


Observe that this implies |Jo(x)| < 1 and |J (x)| < 1/V2, n € N. 
5.30 Use Equations (5.24) and (5.25) to conclude that 


2 T/2 
Jəm(x) = zy cos(x sin 0)cos 2m0 dð, m € No, 
0 


T 


2 T/2 
Jəm-1(£) = zy sin(x sin 0)sin(2m — 1)0 dé, m EN. 
0 


5.31 Prove that limp—oo Jn(x) = 0 for all x € R. Hint: Use Lemma 3.7. 
5.32 Prove that limz. Jn(x) = 0 for all n € No. 


5.5 Orthogonality Properties 


After division by x Bessel’s equation takes the form 
ze 
xy” +y +(x- N =0, (5.26) 


where the differential operator 


L d d F y? 
= x 
dx \ dz oy 
is formally self-adjoint, with p(x) = x and r(x) = —v?/x in the standard form 
(2.33). Comparison with Equation (2.34) shows that p(x) = x is the weight 


function, but the eigenvalue parameter does not appear explicitly in Equation 
(5.26). We therefore introduce a parameter u through the change of variables 


vr uz, y(x) > y(par) = u(x). 


Differentiating with respect to x, 
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Under this transformation, Equation (5.26) takes the form 


ax 


2 
cu! +u + (ra = 2) u=0, (5.27) 


where the eigenvalue parameter is now \ = yu”. Equations (5.26) and (5.27) are 
equivalent provided u Æ 0. 


If Equation (5.27) is given on the interval (a,b), where 0 < a < b < ov, then 
we can impose the homogeneous, separated boundary conditions 


ayu(a) +agu'(a)=0, — Byu(b) + bzu’ (b) = 0, 


to obtain a regular Sturm-Liouville eigenvalue problem. The eigenfunctions 
then have the form 


Cudv (px) + dpYy (u2), 


where j1,c,, and d, are chosen so that the boundary conditions are satisfied. 
The details are generally quite tedious, and we simplify things by taking a = 0. 

Suppose, therefore, that Equation (5.27) is given on the interval (0, b). 
Because p(0) = 0, no boundary condition is needed at x = 0, except that 
lim, 9+ u(x) exist. At x = b we have 


Byu(b) + Bau (b) =0. (5.28) 


The pair of equations (5.27) and (5.28) now poses a singular SL eigenvalue 
problem and, based on an extension of the theory developed in Chapter 2, it 
has an orthogonal set of solutions which is complete in £2(0, b). For the sake 
of simplicity, we restrict v to the nonnegative integers. This allows us to focus 
on the main features of the theory without being bogged down in nonessential 
details. The assumption v = n € Ng is also the most useful from the point of 
view of physical applications. The general solution of Equation (5.27) is then 
given by 


u(x) = cEnJn( u£) + dnYn (ux). 


The condition that u(x) have a limit at x = 0 forces the coefficient of Y, to 
vanish, and we are left with J;,(j7) as the only admissible solution. 


Let us start with the special case of Equation (5.28) when 3, = 0; that is, 
u(b) =0. (5.29) 
Applying this condition to the solution Jn(ux) gives 


Jn (pb) = 0, n € No. (5.30) 


176 5. Bessel Functions 


We have already determined in Theorem 5.3 that, for each n, the roots of 
Equation (5.30) in (0,00) form an infinite increasing sequence which tends 
to oo, 

Eni < Enz < Sng <0 


The solutions of Equation (5.30) are therefore given by 


Myo = Enk 


and the eigenvalues of Equation (5.27) are 


Note that the first zero épo = 0 of the function Jn, for n > 1, does not 
determine an eigenvalue because the corresponding solution is 


Jnl Hor) = Jn(0) =0 for alln EN, 


which is not an eigenfunction. The sequence of eigenvalues of the system (5.27), 
(5.28) is therefore 


0<\ = pî < À = Uh <3 =U <-, 
and its corresponding sequence of eigenfunctions is 
In (Hy), Jn( H22), Ja(u3x), Bee k 


For each n € No, the sequence (Jp (upr) : k € N) is necessarily orthogonal and 
complete in £2(0, b). In other words 


b 
(alija) ala)) = | In(nj0)Jn(ru)rde (5.31) 


=0 forall j Fk, 


and, for any f € £2(0,b) and any n € No, we can represent f by the Fourier- 
Bessel series z 

(F (2), Ja (Het) « 
AOE 


f(z) = Tn (Mat), (5.32) 
the latter equality being, of course, in £2(0, b). If f is piecewise smooth on (0, b) 
Equation(5.32) holds pointwise as well, provided f(x) is defined as $[f(a*) + 
f(a7)] at the points of discontinuity. 
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To verify the orthogonality relation (5.31) by direct computation is not 
a simple matter, but we can attempt to determine ||Jp(a,2)||. Multiplying 
Equation (5.27) by 2zru’, 
2ru' (xu) + (wea? — v?)2uu! = 


0 
(su)? + (wea? — v*) (uJ! =0 


Integrating this last equation over (0, b), 


u’)? +p’ perai — 2 fy zuda] — ara =0, 


1 

2 

=> lul = E [(uzu)? + (zu)? — v’u’] 
0 


With v = n € No, u(x) = Jn(wx), and u(x) = pd} (ux), p > 0 , we therefore 
have 


b 
jiu f Pindit 
b 


1 
= 32 [(WP 2? — n?) Ja (war) + p’? J? (war)] 


Because n? Jp (0) = 0 for all n € No, 
b2 2 yb? _ n2 
Mn = F GD + EE R), (5.33) 


When u = up the last term drops out, by (5.30), and 


lnla)? = ŽA b). 


Using the result of Exercise 5.9, 


1 
[nJn(ukb) — HkbJn+i(Hpb)] = —Jn+1 (Hkh), 


Jy (1,0) = b 
Hk 


we finally obtain 


b 2 


2 n+1(Hy0)- (5.34) 


lJas); = 


Example 5.5 


To expand the function 


1, O0<a4<2 
0, 2<a<4 
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in a Fourier—Bessel series under the condition Jo(4u) = 0, we first evaluate the 
coefficients in the series: 


(F (2), Jo(4u,2)) „= fs) SA 


-f Jo(ups)ede 
0 
1 2Mp 
=a T Jo(y)ydy 
0 


p? 
2 

= — Ji (24x), 
Hk j 


where we used the result of Example 5.4 in the last equality, 4u, being the 
zeros of Jo. From the formula (5.34) we get 


|| Jo(uun2) lI7 = 8J? (4pt,)- 
Now, by (5.32), 


1s (2m) 
4 k=l My Ji (Abe 


f(z) = j oUz), 0<a<4. 


Observe that, by setting x = 1 in this equation, we arrive at the identity 


— Ji (2up)Jolug) = 


ri HeJ (Ape) ' 


because 1 is a point of continuity of f. At the point of discontinuity x = 2, we 
have 


SS Aan) Jolie) FCH _ 


ez HJI (4u) 2 


The more general condition (5.28) 


B,u(b) + bau (b) = 0 


can, in principle, be handled in a similar fashion, although it requires more 
work. If 6, # 0 and 8, # 0, we can assume, without loss of generality, that 
Bə = 1. The solution u(x) = J,(ux) must then satisfy 


By In (ub) + pJ;, (ub) = 0. (5.35) 
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The positive roots of this equation determine the eigenvalues ju? of the problem. 
The corresponding eigenfunctions are u(x) = Jn(upx). The norm of each 
eigenfunction is determined by substituting into (5.33) to obtain 


b2 2 ped? = n2 
[Inesa = > kb) + Ea Ih nb) 
2 2u 
k 
1 
= gpg (610? + wb? — n°] Jalab). (5.36) 


To solve Equation (5.35) for u is, in general, not a simple matter. We should 
recall that J, and Je cannot have a common zero, because the solution to the 
initial-value problem for Bessel’s equation is unique, so one would most likely 
have to resort to numerical methods to determine ug. But if 64 = 0, that is, if 
the boundary condition at b is 


u' (b) = 0, (5.37) 
then the eigenvalues are determined by the roots of the equation J}, (ub) = 0. 
The first is Uy = 0, corresponding to the eigenvalue 
Ao = 0, 

and the eigenfunction 
Note that this function solves Equation (5.27) but not (5.26). As mentioned 
earlier, these two equations are not equivalent if u = 0. 

With n = 0, the other values of u, must satisfy Ji (ub) = 0 or, equivalently, 
Ji (u,b) = 0. Thus the sequence 


My = ik /b, KEL 25356005 


where €,;, are the positive zeros of Jı, yields the remaining eigenvalues of the 


problem, 
2 
À — Eik 
k= b2 ’ 


corresponding to the eigenfunctions 


keN, 


ur(x) = Jo(up), KEN. 


Under the boundary condition (5.37), if n is a positive integer, then 0 is not 
an eigenvalue, and we obtain the eigenvalues A, = u? by solving 


Tn (Und) = 0. 


The corresponding eigenfunctions are then u(x) = Jn(u,xv). In any case, 
whether n is 0 or positive, the norm of Jn is calculated from the formula (5.36) 
with 6, =0. 
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Example 5.6 
In R? the cylindrical coordinates (r,6,z) are related to the Cartesian coordi- 
nates (x,y,z) by 

x = r cos 0, y= rsin, 


r= syr? +y, 
where 0 < r < œ and —a < 0 < a. The Laplacian operator in cylindrical 
coordinates has the form 
aoe ala ge ED a 
=- ðr? r Or r2?əð Oz?" 


Given the cylindrical region 
N = {(r,0,z): 0 <r <b, -r <0 <T, 0<z<hy}, 
we seek a potential function u which satisfies Laplace’s equation 
Au = urr +r tur + 299 + uz =0 


in 2, and assumes the following values on the boundary of 92: 


u(r, 6,0) = 0, 0<r<b, —1t<0<7, 
u(b, 6, z) = 0, -T <0<T, O<z<h, 
u(r, 0,h) = f(r), 0<r<b, —1<O< 7. 


This is often referred to as a Dirichlet problem in 22. 

The general Dirichlet problem is to determine the function u in a domain 
N which satisfies Laplace’s equation Au = 0 in 2 and u = f on the boundary 
ðN, where f is a given function defined on 02. When ðN and f satisfy certain 
regularity conditions, the problem is known to have a unique solution in O?() 
(see [13]). 


Solution 


Because u is independent of 0 on the boundary, we can assume, by symmetry, 
that it is also independent of 0 inside 2. Using separation of variables, let 


u(r, z) = v(r)w(z), O0<r<b, 
and substitute into Laplace’s equation to obtain 


v" (r)w(z) +r (r)w(z) + v(r)w” (z) = 0. 
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This leads to the pair of equations 


r2y"(r) + ro'(r) + p?r2u(r) = 0 
ad” (a) — (2) = 0, 


where the separation constant was assumed to be —y?. Equation (5.38) is 
Bessel’s equation with v = 0, whose continuous solution in 0 < r < bis Jo(ur). 
Under the boundary condition u(b) = 0, the eigenfunctions are Jo(j,7) corre- 
sponding to the eigenvalues Ap = u? = (E9;,,/b)?. As before, £91, £92; Eog.. 
are the positive zeros of Jo. 

The corresponding eigenfunctions of Equation (5.39) under the condition 
w(0) = 0 are sinh(y,z). Hence the sequence of eigenfunctions of the original 
problem is 

un(r, z) = Jolugr)sinh(ugz), KEN. 


Before applying the nonhomogeneous boundary condition at z = h, we form 
the more general solution 


u(r, z) = 5o CkUk (r, z). 
k=1 
Now the condition u(r, h) = f(r) implies 


f(r) = $ cxsinh(uph) Jo (ugr) , (5.40) 
k=1 


which is the Fourier—Bessel series expansion of the function f, assuming of 
course that f is piecewise smooth on (0,0). The coefficients in the expansion 
are given by 

1 

cxsinh(yigh) = a | FOr) dr, 
I|Jo(unr) II J0 

and this completely determines c; for all positive integers k. Bessel functions 
usually appear in the solution of Laplace’s equation when cylindrical coordi- 
nates are used, hence they are often referred to as cylinder functions. 


EXERCISES 


5.33 Determine the Fourier—Bessel series expansion )> ckJo(ugz) on (0, b], 
where uy are the positive roots of the equation Jo(ub) = 0, for each 
of the following functions. 


(a) f(@)=1 
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5.34 


5.35 


5.36 


5.37 


5.38 


5.39 


5.40 
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(b) f(a) =« 
(c) f(«) = 2? 
(d) f(x) =b? — 2? 
1, 0<a<b/2 
(e) f(a) = 4 1/2, a = b/2 
0, b/2 <x <b. 


Expand the function f(x) = 1 on [0,1] in terms of Jo(upx), where 
Up are the nonnegative zeros of Jj. 

Expand f(z) = x on [0,1] in terms of Jı (upr), where py, are the 
positive zeros of Jj. 

For any positive integer n, expand f(x) = x” on [0,1] in terms of 
In(U,2), where ju, are the positive zeros of J}. 


Determine the coefficients in the series X` cp Jı (upr) which represents 


the function eee 
a <a< 
ra=] 0, 1<r<2 
on [0,2], where ju, are the positive zeros of Ji (2u). Is the represen- 
tation pointwise on [0,2]? 
Show that 0 is an eigenvalue of Bessel’s equation (5.27) subject to 
the boundary condition (5.28) if, and only if, 81/82 = —v/b, and 
that the corresponding eigenfunction is 7”, where v > 0. 
The heat equation on a circular flat plate is given in polar coordinates 
(r, 0) by 
1 1 
u= k (i + -ur + zuo) ; 
r r 
Suppose that the temperature u = u(r, t) does not depend on @ and 
that 0 < r < 1. If the edge of the plate is held at zero temperature for 


all t > 0, use separation of variables to show that the temperature 
on the plate is given by 


u(r, t) = 5 cne at Jof uar) for allr € [0, 1), t>0, (5.41) 


n=1 
where u, are the positive zeros of Jo. 
If the initial temperature on the plate in Exercise 5.39 is u(r,0) = 


f(r), prove that the Fourier—Bessel coefficients in (5.41) are given 
by 


2 1 
cn = FG) [fo 0lir\rar 
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5.41 A thin elastic circular membrane vibrates transversally according to 
the wave equation 


1 
uu = 2 (tre + Eur), O0<r<R,t>0. 
r 


If the boundary condition is u(R,t) = 0 for all t > 0, and the initial 
conditions are 


u(r,0) = f(r), u(r, 0) = g(r), 0<r<R, 


determine the form of the bounded solution u(r, t) in terms of J, for 
all r € [0, R) andt > 0. 
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The Fourier Transformation 


The underlying theme of the previous chapters was the Sturm—Liouville theory. 
The last three chapters show how the eigenfunctions of various SL problems 
serve as bases for £L?, either through conventional Fourier series or its general- 
ized version. In this chapter we introduce the Fourier integral as a limiting case 
of the classical Fourier series, and show how it serves, under certain conditions, 
as a method for representing nonperiodic functions on R where the series ap- 
proach does not apply. This chapter and the next are therefore concerned with 
extending the theory of Fourier series to nonperiodic functions. 


6.1 The Fourier Transform 


Suppose f : R —> C is an £? function. Its restriction to (—l, l) clearly lies in 
£?(-1,1) for any 1 > 0. On the interval (—I,/) we can always represent f by 
the Fourier series 


f(z) = y epe Trel (6.1) 


n=— oo 


1 f j 
Cn = af f(a) lde, neZ. (6.2) 
=f 
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Let A£ = m/l and €, = nA€E = nr/l. The pair of equations (6.1) and (6.2) 
then take the form 


2) = 5 DCE, et" Ag, (6.3) 


l 
Chén) = 2lcn = if f(a) Eda. (6.4) 


If we now let l — oo, that is, if we allow the period (—1,/) to increase to R so 
that f loses its periodicity, then the discrete variable €,, will behave more as a 
real variable €, and the formula (6.4) will tend to the form 


gaJ” f(a)e" 8" da. (6.5) 


The right-hand side of (6.3), on the other hand, looks very much like a Riemann 
sum which, in the limit as | — oo, approaches the integral 


fla) =f CO (6.6) 


Thus the Fourier coefficients c, are transformed to the function C(£), the 
Fourier transform of f, and the Fourier series (6.1), which represents f on 
(—1,1), is replaced by the Fourier integral (6.6) which, presumably, represents 
the function f on (—oo, 00). 

The procedure described above is, of course, not intended to be a “proof” 
of the validity of the formulas (6.5) and (6.6). The integral in (6.5) may not 
even exist. It is meant to be a plausible argument for motivating the definition 
(to follow) of the Fourier transform (6.5), which can then be used to represent 
the (nonperiodic) function f by the integral (6.6), in much the same way that 
the Fourier series were used in Chapter 3 to represent periodic functions. 

For any real interval J, we use the symbol £1(I) to denote the set of func- 


tions f : I — C such that 
firo dz < oo. 
I 


Thus, if I is a bounded interval, any integrable function on J, and in particular 
any piecewise continuous function, belongs to £1(I). Furthermore, 


xr“ €£'(0,1)6a>-1, 


r“ €L£' (1,00) Sa < —1. 
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If I is unbounded, a function may be integrable (in the improper sense) without 
belonging to L! (I), such as sin x/zx over (0,00) (see Exercise 1.44). That is why 
we refer to £1(I) functions as absolutely integrable functions on J in order to 
avoid this confusion when J is unbounded. As with £?(J), it is a simple matter 
to check that £! (T) is also a linear space. 


Definition 6.1 


For any f € £1(R) we define the Fourier transform of f as the function f : 
R — C defined by the improper integral 


Kos Oea (6.7 
We also use the symbol F(f) instead of f to denote the Fourier transform of 


f. 


Because |e'$*| = 1 we clearly have 


ls [le lar < o: 


that is, f is a bounded function on R. By the linearity of the integral, 
Fla fit c2f2) =aF(fi) + cF (fa) 


for all c1,¢c2 € C and all fi, f2 € L!(R), which just means that the Fourier 
transformation F : f — f is linear. 


Example 6.2 


For any positive constant a, let 


Then 


as shown in Figure 6.1. Note that lima—oo fa(€) does not exist and that f(x) = 1 
does not lie in £1(R). 
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Figure 6.2 


Example 6.3 


In the case of the function f(x) = e7!*! we have (see Figure 6.2) 


0 oo 
FE =d eat | ete "dz 
0 


o 1 1 
7 ine De 
o 2 
Sro 


When |f| is integrable over R (i.e., when f € £1(IR)) we have seen that its 
Fourier transform f is bounded, but we can also prove that f is continuous. This 
result relies on a well-known theorem of real analysis, the Lebesgue dominated 
convergence theorem, which states the following. 


Theorem 6.4 


Let (fn : n € N) be a sequence of functions in £!(I), where T is a real interval, 
and suppose fn — f pointwise on I. If there is a positive function g € L*(1) 
such that 
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lfn(x)| < g(x) for alla € I, neéN, 
then f € £1(I) and 


lim | fn(x)dx = [ fae. 
The proof of this theorem may be found in [1] or [14], where it is presented 
within the context of Lebesgue integration, but it is equally valid when the 
integrals are interpreted as Riemann integrals. Note that the interval J is not 
assumed to be finite. 
To prove that f is continuous, let € be any real number and suppose (€,,) 
is a sequence which converges to €. Since 


OR fo] s fo let | lsat, 


[emne — e "| [f(z)| < 2|f(a)| € L'(R), 


and 
lim le~*n# — | = 0, 
Theorem 6.4 implies 
Jim [AE - FQ] < lim f Jet - "| [f(a] de 


= if lim Je~%n# — emise] |f (x)| dx 


=0. 
In order to study the behaviour of f(E) as |E] > 00, we need the following 


result, often referred to as the Riemann—Lebesgue lemma. 


Lemma 6.5 


Let f be a piecewise smooth function on R. 
(i) If [a,b] is a bounded interval, then 


b 
ae / f(x)e*dx = 0. 
(ii) If f € £1(R), then 


lim i f(a)es*dx = 0. 


|| 00 
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Proof 


(i) Let £1, £2, . . . , £n be the points of discontinuity of f and f’ in (a,b), arranged 
in increasing order, and let a = zo and b = %y41. We then have 


Tk+1 


1 l f(a)e®* de = > f , f(a)e®* dz, 


and it suffices to prove that 


Tk+1 ; 
lim f(x)es*dx = 0 for all k. 


|E| 00 LE 


Integrating by parts, 


Tk+1 , 1 : Tk+1 1 Tk+1 i 
[T toei taes f'(aeax, 
x i} 


k 


and the right-hand side of this equation clearly tends to 0 as é — +00. 
(ii) Let £ be any positive number. Because |f| is integrable on (—00, 00), we 
know that there is a positive number L such that 


€ 
< J, __ Old < a 


But, from part (i), we also know that there is a positive number K such that 


E fojea- f f(ae®* dx 


< - for all |€| > K. 


| a : f(æ)je da 


Therefore, if |£| > K, then Kees f(a)et§*da| <E. 


We have therefore proved the following theorem. 


Theorem 6.6 


For any f € L! (R), the Fourier transform 


fla) =f sleje*ar 


is a bounded continuous function on R. If, furthermore, f is piecewise smooth, 
then 
lim f(€) =0. (6.8) 


|| 00 
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Remark 6.7 


1. Lemma 6.5 clearly remains valid if e8” is replaced by either cos Ex or sin êz. 
2. The Riemann—Lebesgue lemma actually holds under the weaker condition 
that |f| is merely integrable, but the proof requires a little more work (see [16] 
for example). In any case we do not need this more general result, because 
Lemma 6.5 is used in situations (such as the proof of Theorem 6.10) where f 
is assumed to be piecewise smooth. 

3. In view of the above remark, Equation (6.8) holds for any f € £1(R) without 
assuming piecewise smoothness. 


We indicated in our heuristic introduction to this chapter that the Fourier 
transform f , denoted C in (6.5), plays the role of the Fourier coefficients of the 
periodic function f in the limit as the function loses its periodicity. Hence the 
asymptotic behaviour f(€) > 0 as € > too is in line with the behaviour of the 
Fourier coefficients cn when n — +oo. But although Theorem 6.6 states some 
basic properties of f , it says nothing about its role in the representation of f 
as suggested by (6.6), namely the validity of the formula 


fla) =f fleas. (6.9) 


The right-hand side of this equation is called the Fourier integral of f. This 
integral may not exist, even if the function f(€) tends to 0 as |€| > oo, unless 
it tends to 0 fast enough. Furthermore, even if the Fourier integral exists, the 
equality (6.9) may not hold pointwise in R. This is the subject of the next 
section. 


EXERCISES 


6.1 Determine the Fourier transform of each of the following functions. 


1—|a|, |2|<1 


ees. 


cosg, |r| <7 


CO 


1, O<a<il 
(c) f(z) -f 0, otherwise. 
6.2 Given any f : I — C, prove that 
(a) If I is bounded and f € £L?(I), then f € L!(I), 


(b) If f is bounded and f € L!(I), then f € £?(I). 
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6.4 


6.5 


6.6 


6.7 


6.8 
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Let y: I x J — C, where I and J are real intervals, and suppose 
that, for each x € I, y(a,-) is a continuous function on J. If y(-, €) 
is integrable on J for each € € J, and there is a positive function 
g € L*(I) such that |y(x,£)| < g(x) for all x € I and € € J, 
use the dominated convergence theorem to prove that the function 
F(€) = f p(x, €)dz is continuous on J. 


Under the hypothesis of Exercise 6.3, if y(a,-) is piecewise continu- 
ous on J, prove that F is also piecewise continuous on J. 


If the function y(z,-) in Exercise 6.3 is differentiable on J and 
|ve(x,-)| < h(x) for some h € L'(I), prove that F is differentiable 
and that its derivative F’(€) equals {,y¢(2, €)dx. If ye is continuous 
on J, prove that F” is also continuous on J. Hint: For any € € J, 


Ppl, E) = (En — ET lE, En) — plr, €)] > pele, £) as En > £. Use 


the mean value theorem to conclude that |Y, (x, E€)| < |h(x)| on I xJ. 


Using the equality 
2? 1 
J e Sdx== forall £> 0, 
0 € 
show that, for any positive number a, 


°F n! 
I OPES a E>a, nEN. 
0 


Note that if we set € = 1 in this last equation, we arrive at the 
relation n! = I (n + 1). 


Use Exercises 6.3 and 6.5 to deduce that the gamma function 
I(é) = e ? Sl da 
0 


is continuous on [a,b] for any 0 < a < b < ow, and that its nth 
derivative 


co d” 
PM (ë) = | e *® — (x51) dr 
Gian ee) 
is continuous on [a,b]. Conclude from this that I € C% (0, 00). 


Use Lemma 6.5 to evaluate the following limits, where Dp is the 
Dirichlet kernel defined in Section 3.2. 
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m/2 
) limno J Dn(x)dx 
n/6 


6.9 Let f and g be piecewise smooth functions on (a,b), and suppose 
that £1,..., £n are their points of discontinuity. Prove the following 
generalization of the formula for integration by parts. 


Aol Paige eee) eC Cra 


6.2 The Fourier Integral 


The main result of this section is Theorem 6.10, which establishes the inversion 
formula for the Fourier transform. The proof of the theorem relies on evaluating 


the improper integral 
if ® sing 
dx, 
0 x 


which is known as Dirichlet’s integral. To show that this integral exists, we 


write i í 
| ee de = | alee + lim ome dee. (6.10) 
0 0 


£ x b> Jy x 


Because the function sinz/x is continuous and bounded on (0,1], where it 
satisfies 0 < sina/a < 1, the first integral on the right-hand side of (6.10) 
exists. Using integration by parts in the second integral, 


? sina cos b ? cosg 
dx = cos 1 z dr, 
i, i2, b 1 T 


ee cose a ier 


and noting that 
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yA 


=T 


Figure 6.3 


we see that limp ox J (cos a/a*)dx exists. Hence the integral [P (sin x/x)dx 
is convergent (see Exercise 1.44 for an alternative approach). The integrand 
sin z/a is shown graphically in Figure 6.3. 

Now that we know Dirichlet’s integral exists, it remains to determine its 
value. 


Lemma 6.8 

f MT de =, (6.11) 

0 x 2 
Proof 
Define the function 
: O0<a< 
—— ; EST 
f(z) = x 2sin ix 
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It is a simple matter to check that f and its derivative are both continuous on 
(0, 7]. By Lemma 6.5, 


lim a f(ax)sin a dx = 0. (6.12) 


E00 0 


Therefore 


= lim = dx by (6.12 
¿>œ 2 Jo sin iz ¥ ( ) 
. 1 f™sin(n+ 5 

gees 
nc 2 Jo sin 52 


Going back to the definition of the Dirichlet kernel 
Dy (x) = oa 5 gue 
n on 3 
k=—n 
and using Lemma 3.8 to write 


A 1 
A aa 


2m sin ir 


I snes = lim rf Dy( 
0 £ n—0o ~ 9? 


by Equation (3.17). 


we conclude that 


Although the function 
sin éx 


£ 
is continuous with respect to both x > 0 and € € R, the function defined by 


the improper integral 
K(€) = i singe gy 
0 T 


is not continuous at € = 0, for 


K(g)= 4 0, &=0 
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This implies that the function |sin€2/z| is not dominated (bounded) by an 
£1(0,00) function, which clearly follows from the fact that |sinz€/z| is not 
integrable on (0, co) (see Exercise 1.44). The situation we have here is analogous 
to the convergence of the Fourier series 


S sin n 
n 


n=1 


to a discontinuous function because its convergence is not uniform. Based on 
this analogy, the improper integral 


F= f oe.s)ae 


is said to be uniformly convergent on the interval I if, given any € > 0, there 
is a number N > a such that 
| f e €)dx 


The number N depends on € and is independent of £. 
Corresponding to the Weierstrass M-test for uniform convergence of series, 


b>N > 


rO- fe (a, €)da} = 


ZE for all € € I. 


we have the following test for the uniform convergence of improper integrals. 
We leave the proof as an exercise. 


Lemma 6.9 


Let y: [a,oo) x I — C, and suppose that there is a function g € L1(a, oo) such 
that |y(x,€)| < g(x) for all € in the interval J. Then the integral f° p(x, €)dzx 
is uniformly convergent on I. 


If a function y(x,&) satisfies the conditions of Lemma 6.9 with I = fa, 8] 


and if, in addition, y(x,-) is continuous on [a, 8] for each x € [a, o0), then the 
function 


is also continuous on |a, 8] (Exercise 6.3), and satisfies 


[re \dé = ee [ve o)déde. (6.13) 


This follows from the observation that the uniform convergence 


b 
J eeddr FE as b> 
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implies that, for every £ > 0, there is an N > 0 such that 


zi glx)dx < € 
b 


> |f rox- f f oeda dé — [ [ve é)dedé 
Seah) 


In other words, under the hypothesis of Lemma 6.9, we can change the order 
of integration in the double integral 


T [ee patax= f [ e €)dardé. 


This equality remains valid in the limit as 8 — oo provided F is integrable on 


[a, co). 

Pushing the analogy with Fourier series to its logical conclusion, we should 
expect to be able to reconstruct an £! function f solely from knowledge of its 
transform f , in the same way that a periodic function is determined (according 
to Theorem 3.9) by its Fourier coefficients. Based on Remark 3.10 and Equation 
(6.6), we are tempted to write 


ra= f Aoa 


assuming that x is a point of continuity of f, and we would not be too far off the 
mark. The only problem is that, although continuous and bounded, | if 


may not 
be integrable on (—oo, co), so that the above integral may not converge. Some 
treatments introduce a damping function, such as ene / 2 into the integrand 
to force convergence, and then take the limit of the resulting integral as € — 0. 
Here we introduce a cut-off function, already suggested by the Fourier series 


representation 
N 


f(z) = lim 5 Cne”, (6.14) 


N-co 
n=—N 


which is further elaborated on in Remark 6.11. 
Corresponding to Theorem 3.9 for Fourier series, we have the following 
fundamental theorem for representing an £L! function by a Fourier integral. 
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Theorem 6.10 
Let f be a piecewise smooth function in £'(R). If 

= | f(xje dr, EER, (6.15) 
then 


i xf i f@eae = Fife) +E] foralreR (616) 


Loo 27 
Before attempting to prove this theorem, it is worthwhile to consider some 


important observations on the meaning and implications of Equation (6.16). 


Remark 6.11 
1. The limit 
eS Jer’ 
Jim fei dé (6.17) 


is the Cauchy principal value of the improper integral 
1 E 
=| F(E EdE = iim tim 55 f AO fE edt. (6.18) 


The restricted limit (6.17), as is well known, may exist even when the more 
general limit (6.18) does not. If f lies in £!(R) then, of course, the two limits 
are equal. 

2. If f is defined by 


f(a) = ZE) + Fe) (6.19) 


at every point of discontinuity x, then (6.16) becomes 


f(a) = lim a: fede = F(A), 
the inverse Fourier transform of f , or the Fourier integral of f : 


3. When f € £!(R) and (6.19) holds, Equations (6.15) and (6.16) define the 
transform pair 


f= FINO = f * f(a)eni® de, 
HoF A= xf fear, 
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a representation which exhibits a high degree of symmetry. Some books adopt 


the definition i E 
fA Bet ey x£ —ix& 
MO = | e*Eda, 


1 ary ix 
He) = af Oa, 


and thereby achieve complete symmetry. Our definition is a natural develop- 
ment of the notation used in Fourier series. 

4. If f = 0 then (6.16) implies f = 0, assuming of course that Equation (6.19) is 
valid. This means that the Fourier transformation F, defined on the piecewise 
smooth functions in £1(R), is injective. 


from which 


Proof 


/ i iors] i | f 7 f) ay elt dg 


zi. Í f. i fu) E- ddy, 


where we used Equation (6.13) to change the order of integration with respect 
to y and £, because the function 


NO 


satisfies the conditions of Lemma 6.9. Integrating with respect to €, we obtain 
L DA 
E sin L(x — y 
f Fosas f SE yay 
=L =o 


=2f He + nan, (6.20) 


Suppose now that ô is an arbitrary positive number. As a function of 
n, f(a + 7)/n is piecewise smooth and its absolute value is integrable on 
(—oo, —ô] U [6, 00), so we can use Lemma 6.5 to conclude that 


in L 
om te + n)dn = 0. 


lim 
L> Jiz 1 


Taking the limits of both sides of Equation (6.20) as L — oo, we therefore have 


L-oo 


L Sd 
lim J f(E dE = 2 lim Í y +n)dn 
-L Loo J_§ n 


ô in 
=2 lim | sin (a +n) + fle ndn: 
0 n 


I-00 
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Now Lemma 6.5 implies 


lim sin Ly 
Loo Jo 


and Lemma 6.8 gives 


hence 

jim, f mA F(a + n)dn = gfe") 
Similarly, 

jm, > n -ndn = 3f) 
Consequently, 


L—oœ 


L 
lim J F(E EdE = a[f(at) + f(e), 
BS 


and we obtain the desired equality after dividing by 27. 


The similarity between Theorems 3.9 and 6.10 is quite striking, although it 
is somewhat obscured by the fact that the trigonometric form of the Fourier 
series was used in the statement and proof of Theorem 3.9, whereas Theorem 
6.10 is expressed in terms of the exponential form of the Fourier transform and 
integral. The correspondence becomes clearer when the formulas in Theorem 
3.9 are cast in the exponential form 


Cn = ~ 3 f(aje Sde 
N 1 P E 
da, D eve = E EEO 


On the other hand, the trigonometric form of Theorem 6.10 is easily derived 
from Euler’s relation et? = cos 0+isin 0. Suppose f € £L!(R) is a real, piecewise 
smooth function which satisfies 


[f(@*) + f(x) 
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at each point of discontinuity. Such a function has a Fourier transform 
A o9 . 
fle)= f a 
—Cco 


= f : f(x)cos Exda — i / 7 f(x)sin éx dx 


= A(é) — iB(é), (6.21) 

where 
A(€) = a f(x)cos&x dz, (6.22) 
B(é) = f. f(§)sin€a dz, EER, (6.23) 


are the Fourier cosine and the Fourier sine transforms of f, respectively. 
From Theorem 6.10 we can now express f(x) as 


L 
fe) = jim zs | AO -iBEE d, 
Because A(€) is even and B(£) is odd, we have 


L 
PC Tes J [A(E)cos x€ + B(E)sin x€]dé 


-L 


L 
= lim -{ [A(é)cos rE + B(€)sin xé]dé 
0 


= [ ees xE + B(E)sinxé]d€, «eR. (6.24) 
T Jo 


If f is even, then (6.23) implies B(€) = 0 and the transform pair (6.22) and 
(6.24) take the form 


A(é) = 2 f e (6.25) 

f(x) = : | NOREA (6.26) 
If f is odd, then A(£) = 0 and we obtain 

B(é) =2 f "oei, (6.27) 

f(x) = L 1 * BUONE, (6.28) 


The attentive reader will not miss the analogy between these formulas and 
the corresponding formulas for an and bn in the trigonometric Fourier series 
representation of a periodic function. 
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Example 6.12 


Applying Theorem 6.10 to Example 6.2, we see that 


l a = 1 f la Ecos zé dé 
pa = “aie E ae ese Seige COS T 
2 [®1 
= F — sin a£ cos x dé 
TJo £ 


because sin a€/€ is an even function and cos €z is the even (real) part of etS7. 
Therefore, recalling the definition of fa, we arrive at the following interesting 
evaluation of the above integral, 


0, x < —a 
9 f>] 1/2, z= —a 
F - sina cos xé d = ¢ 1, —a<xr<a 
mJo & 1/2, vr=a 

0, r>a. 


The fact that the right-hand side of this equality is not a continuous function 
implies that the integral in the left-hand side does not converge uniformly. 


Example 6.13 


The function 


fa) ={ sin x, |x| <r 


0, |x| > 7 


is odd. Hence its cosine and sine transforms are, respectively, 


and 


W 
SEN 
In 
NS 

II 


T f(x)sin €x dx 


= 2 | sinxsin€x dx 
0 
1 
Pe 
_ 2sin 7€ 
To. 
f(x) and B(€) are shown in Figure 6.4. 


Il 


sin(1 — €)7 


= x 
IFE sin(1 + E)r 
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Figure 6.4 f and its sine transform. 


By (6.28) the Fourier integral representation of f is therefore 


2 (~sin7ré . 
f(x) = a -ë sin zé d£. (6.29) 
Note here that ; 
lim = me sin zÊ = +" sin x; 
€é—-+1] — E 2 


so the integrand (1 — ¿°)-tsin mêsin z is bounded on 0 < € < oo, and is 
dominated by c/é* as £ > oo for some positive constant c. The Fourier integral 
(6.29) therefore converges uniformly on R to the continuous function f. 


Example 6.14 


We have already seen in Example 6.3 that the Fourier transform of the even 


function e7!*! is 
2 


1+ €’ 


Pegs 


which is an £! function on R, hence, using (6.26), 


2 CO 
eo lal — z Conte ie 
TJo Leg 


At x = 0 we obtain the familiar result 


a a = F 


We conclude this section with a brief look at the Fourier transformation in 
L?, the space in which the theory of Fourier series was developed in Chapter 
3. Exercise 6.2 shows that, in general, there is no inclusion relation between 
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£'(R) and £?(R), but bounded functions in £'(R) belong to £L?(R), because 
f If? < M f |f| whenever |f| < M. Suppose that both f and g are piecewise 
smooth functions in L! (R), and hence bounded, and that their transforms f and 
g also belong to £1(R). According to Theorem 6.6, the functions f and ĝ are 
continuous and bounded on R. The improper integrals 


co co 
f Poea f aeea 
= —oo 
are also piecewise smooth on R, for they represent the functions 27 f(x) and 
27g(x), respectively, by Theorem 6.10. Here, of course, we are assuming that f 
and g are defined at each point of discontinuity by the median of the “jump” 


at that point. Thus the functions f, g, f, and ĝ all belong to £2(R), and we 
can write 


2r (fo) = 20 | Heede 
=| f 1k 
= A j J : f(x)e®"9(@dadé 
TEO 
= (Fa) . (6.30) 


When g = f, we obtain a relation between the £? norms of f and its Fourier 
transform, 


JA = 2x11, (6.31) 


which corresponds to Parseval’s relation (1.23). Equations (6.30) and (6.31) 
together constitute what is known as Plancherel’s theorem, which actually holds 
under weaker conditions on f and g. In fact, this pair of equations is valid 
whenever f and g lie in £?(R). The proof of this more general result is based 
on the fact that the set of functions £7(R) N £1(R) is dense in £?(R), that is, 
that every function in £?(R) is the limit (in £?) of a sequence of functions in 
L? (R) N LER). 


EXERCISES 


6.10 Express each of the following functions as a Fourier integral in 
trigonometric form: 
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_ f [sinz], lal <r 
@ saf F ee 
1, 0<ar<1 
o 1) ={ 9) x<0, ¢>1 
1-2, O0<a<l 
(c) f(~)=4 -l-«a, -l<a<0 
0, |x| > 1. 
cos T, 0< z< T/2 
(d) f(£)= 4 —cosz, —r/2<£ <0 
0, |z| > 7/2. 
cos T, O<T<T 
@ re={ 5 x<0, T>. 


6.11 Use the result of Exercise 6.10(e) to show that 
œ €sin mé 
— 2 dg 


6.12 Prove that 


enor n É =f ene ie for alle > 0, a>0. 
T Eta 


Explain why the equality does not hold at x = 0. 
6.13 Prove that 


a Ea = = 5e cosg for all x > 0. 


Is the integral uniformly convergent on [0, 00)? 


6.14 Determine the Fourier cosine integral for e~* cosx, x > 0. Is the 
representation pointwise? 


6.15 Prove that 


oo 1/2 O<a<T 
1 = 3 
J SSi t naéd€ = 4 7/4, T= 
0 


é 0, L>T 


6.16 Determine the Fourier transform of f(x) = (1 + x?)™t, z € R. 
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6.17 Given el | 
1—|za|, a|<1 
fe) ={ 4 In| > 1, 
show that 
ARTER ae 
fg = [SP] 
then conclude that 
© gin? € 
/ 2 dE =T. 


n12 
6.18 Verify the equality |A| = 2r ||f||? in the case where f(x) = e7!*I, 


6.19 Express the relation (6.31) in terms of the cosine and sine transforms 
A and B. 


6.3 Properties and Applications 


The following theorem gives the fundamental properties of the Fourier transfor- 
mation under differentiation. The formula for the transform of the derivative is 
particularly important, not only as a tool for solving linear partial differential 
equations, but also as a fundamental result on which the existence of solutions 
to such equations is based. 


Theorem 6.15 


Let f € £'(R). 
(i) If f’ € £1(R) and f is continuous on R, then 


FF V(E) = iF (FE), EER. (6.32) 
(ii) If f(x) € £1(R), then F(f) is differentiable and its derivative 
EFEO =F if) EER, (6.33) 


is continuous on R. 


Proof 


(i) |f’| being integrable, its Fourier transform F(f’) exists and 


Frye) = [ ~” Pae da. 
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The continuity of f allows us to write fọ f’(t)dt = f(x) — f(0), hence the two 
limits 


lim f(x) = f(0) + lim "p(at 


Te got 
0 


exist. But because f is continuous and integrable on R, lims—+ə f(x) = 
Now integration by parts yields 


FENO = Hee, Hie f7 Hoea 


= iF (f)(€). 
(ii) 
Ter AS) - FO _ F pee 
Ag et ee f(x) Ze dz. 
In the limit as AE — 0, we obtain 
d í —i(€+AE)x e tga P 
ae? PK = 9= jim, f f(z AE hee 
By hypothesis, the limit 
—i(E+AE)x _ pix ; 
lim f(x) : = —ixf(x)e—** 


AE—0 AE 


is dominated by |xf(x)| € £1(R) for every £ € R, therefore we can use Theorem 
6.4 to conclude that 


SPN = | Histol ae = Fl - irf). 


Using induction, this result can easily be generalized. 


Corollary 6.16 


Suppose f € L1(R) and n is any positive integer. 
(i) If f) € L'(R) for all 1 < k <n, and f(~” is continuous on R, then 
FNS = i)" F (FYE), (6.34) 


(ii) If x” f(x) € L'(R), then 


FDO = Fil — iz)” f8). (6.35) 
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The integrability of |x” f(a)| on R may be viewed as a measure of how fast 
the function f(x) tends to 0 as z — oo, in the sense that f(x) tends to 0 
faster when n is larger. Equation (6.34) therefore implies that, as the order of 
differentiability (or smoothness) of f increases, so does the rate of decay of f : 
Equation (6.35), on the other hand, indicates that functions which decay faster 
have smoother transforms. 


Example 6.17 


Let 
f(x) = ae zceER. 


To determine the Fourier transform of f, we recall that 


i 7 e? dr = yT (6.36) 


(Exercise 4.15). Because f satisfies the conditions of Theorem 6.15, Equation 
(6.33) allows us to write 


$ f(€) = —i D ret etde 


The solution of this equation is 
FE = ett, 


where the integration constant is determined by setting € = 0 and using Equa- 
tion (6.36); that is, 

c= f(0) = vr. 
Thus F(e™® )(€) = Yre- 4, 


6.3.1 Heat Transfer in an Infinite Bar 


Just as Fourier series served us in the construction of solutions to boundary- 
value problems in bounded space domains, we now show how such solutions 
can be represented by Fourier integrals when the space domain becomes un- 
bounded. The question of the uniqueness of a solution obtained in this manner, 
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in general, is not addressed here, as it properly belongs to the theory of par- 
tial differential equations. But the equations that we have already introduced 
(Laplace’s equation, the heat equation, and the wave equation) all have unique 
solutions under the boundary conditions imposed, whether the space variable 
is bounded or not. This follows from the fact that, due to the linearity of these 
equations and that of their boundary conditions, the difference between any 
two solutions of a problem satisfies a homogeneous differential equation un- 
der homogeneous boundary conditions, which can only have a trivial solution 
(see [13]). 


Example 6.18 


Suppose that an infinite thin bar has an initial temperature distribution along 
its length given by f(x). We wish to determine the temperature u(x,t) along 
the bar for all t > 0. To solve this problem by the Fourier transform, we assume 
that f is piecewise smooth and that |f| is integrable on (—o0, 00). 

The temperature u(x,t) satisfies the heat equation 


Ut=kter, “r< t>0, (6.37) 
and the initial condition 
u(z,0) = f(a), —œ < z< œ. (6.38) 
As before, we resort to separation of variables by assuming that 
u(x,t) = v(x) w(t). 
Substituting into Equation (6.37), we obtain the equation 


vu" (a) = 1 w(t) 
ole)  kw(t)’ 


wo<r<ow, t>0, 


which implies that each side must be a constant, say —A”. The resulting pair 
of equations leads to the solutions 

v(x“) = A(A)cos Az + B(A)sin Ax, 

w(t) = CA), 
where A, B, and C are constants of integration which depend on the parameter 
A. Since there are no boundary conditions on the solution, À will be a real 
(rather than a discrete) variable, and A = A(X) and B = B(A) will therefore be 


functions of A, and we can set C(A) = 1 in the product v(x)w(t). Corresponding 
to each À € R, the function 


uy(zx, t) = [A(A)cos Ax + B(A)sin Az]e7”! 
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therefore satisfies Equation (6.37), A(A) and B(A) being arbitrary functions of 
à € R. We do not lose any generality by assuming that À > 0 because the 
negative values of À do not generate additional solutions. 

u(x,t) cannot be expected to satisfy the initial condition (6.38) for a gen- 
eral function f, so we assume that the desired solution has the form 


u(x,t) = Fi uy(a, t)dr 


T 


N 


= F [A(A)cos Ax + B(A)sin Azle" dA. (6.39) 
0 


This assumption is suggested by the summation (3.30) over the discrete values 
of A, and the factor 1/7 is introduced in order to express u in the form of a 
Fourier integral. At t = 0, we have 


u(x,0) = > e [A(A)cos Ax + B(A)sin Az]dA 
T Jo 
= f(x), weER. (6.40) 


By Theorem 6.10 and Equations (6.22) to (6.24), we see that Equation (6.40) 
uniquely determines A and B as the cosine and sine transforms, respectively, 
of f : 


A(A) = ie f(y)cos Ay dy 
B(A) = L f(y)sin Ay dy. 


Substituting back into (6.39), and using the even and odd properties of A and 
B, we arrive at the solution of the heat equation (6.37) which satisfies the initial 
condition (6.38) 


ly Off 2 
u(x,t) = oF / / f(y)[cos Ay cos Ax + sin Ay sin Aaje~** tdydA 


= a e f(y)cos(x — y)à et> tdydA. (6.41) 


The solution (6.41) can also be obtained by first taking the Fourier trans- 
form of both sides of the heat equation, as functions of x, and using Corollary 
6.16 to obtain 

ût = k(i€)?A(E, t) = -kE AlE, t). 


The solution of this equation is 


a(E,t) = cett, 
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where, by the initial condition, c = a(€,0) = f(€). Thus, as a function in €, 


a(E,t) = fle" 


is the Fourier transform of u(x,t) for every t > 0, and u can therefore be 


represented, according to Theorem 6.10, by the integral 


uls, t) = > [ © fete ag 


1 co co . Z 2 7 
= = | J fiT e didy. 


Here the fact that KO) e—*€"t is integrable on —co < € < oo allows us to 


replace the Cauchy principal value in (6.16) by the corresponding improper 
integral, and the change of the order of integration in the last step is justified 


by the assumption that |f| is integrable. Now, because e746" 


of €, we have 


ule) == fsa) f coste— me e dedy, 


which coincides with (6.41). 
Using the integration formula (see Exercise 6.23) 


X 1 
[ e © cos zê dé = N for allze R, b>0, 
0 


we therefore have 


1 oe. 2 
t) = —(y-2) /4ktq 
UG) ae I. Pye Y 


= =f f (x + 2vktp) e`? dp. 


is an even function 


(6.42) 


Integrating by parts, it is straightforward to verify that this last expression for 
u(x,t) satisfies the heat equation. It also satisfies the initial condition u(x, 0) = 


f(x) by the result of Exercise 4.15. 
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Example 6.19 


The corresponding boundary-value problem for a semi-infinite bar, which is 
insulated at one end, is defined by the system of equations 


ut = kuzzr, VO<a<ow, t>0 (6.43) 
u,(0,t)=0, t>0 (6.44) 
u(z,0)= f(x), O<4%<am. (6.45) 


The solution of Equation (6.43) by separation of variables leads to the set 
of solutions obtained in Example 6.18, 


ulz, t) = [A(A)cos Ax + B(A)sin Arlet t, 0<A<ov. 


For each solution in this set to satisfy the boundary condition at x = 0, we 
must have 
A); = )B(A)e™"* =0 for all t > 0. 
x=0 
If Aà = 0 we obtain the constant solution up = A(0), and if B(A) = 0 the 
solution is given by 
ux (x,t) = A(A)cos Az e~*”*, (6.46) 


That means (6.46), where 0 < A < o, gives all the solutions of the heat 
equation which satisfy the boundary condition (6.44). 
In order to satisfy the initial condition (6.45), we form the integral 


1% 1 se ; 
u(z,t) = a, uy(a,t)dr = a A(A)cos rA e7 ** td. 


When t = 0, oe 
io wS= | AONAN ar, (6.47) 
T Jo 


By extending f as an even function into (—oo, 00), we see from the represen- 
tation (6.47) that A(A) is the cosine transform of f. Hence, with f € L! (R), 


A(A) = J f(y)cos Ay dy = 2 | f(y)cos Ay dy, 
—oo 0 
and the solution of the boundary-value problem is given by 
2 [> tas —k)’t 
u(a,t) = — f(y)cos Ay cos Ax e dydà. (6.48) 
T Jo Jo 


Using the identity 2 cos Ay cos Ax = cos A(y — x) + cos A(y + x) and the formula 
(6.42), this double integral may be reduced to the single integral representation 


1 as 2 2 
Fly) fee /Akt | o—(y+x)? /4kt dy. 
2V rkt 1 


u(x,t) = 
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To obtain an explicit expression for the solution when 


, O<y<a 


fw) ={ Be Sods 


we can use the definition of the error function given in Exercise 5.6, 


erf (a E- ae -2° dp, 


to write 


u(x,t) = -(y—2)?/4kt | (yt)? G dy 


wef | 


1 (a—2x)/2V kt Pa (ata) /2V7 kt 3 
= — e~ +f e” dp 
VT J-a/2vVkt a /2V kt 


1 (a—2x)/2V kt p 1 (ata) /2V kt 3 
== | e? a+ — f e ? dp 


E] 


From the properties of the error function (Exercise 5.6) it is straightforward to 
verify that, for all t > 0, 


u(z,t) -0 asx, 


u(x,t) — erf (a/2vie) as «— 0, 
and that, for all x > 0, 
u(z,t) -0 as too. 


Ast—0, 
(x,t) 1 when 0< a <a, 
ý 0 when x >a. 


Thus, at any point on the bar, the temperature approaches 0 as t — oo; and at 
any instant, the temperature approaches 0 as x — oo. This is to be expected, 
because the initial heat in the bar eventually seeps out to oo. It is also worth 
noting that u(a,t) > 1/2 = [f(at)+ f(a~)]/2 as t — 0, as would be expected 
(see Figure 6.5). 
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Figure 6.5 Temperature distribution on a semi-infinite rod. 


6.3.2 Non-Homogeneous Equations 


The nonhomogeneous differential equation y — y” = f can be solved directly 
by well-known methods when f is a polynomial or an exponential function, for 
example, but such methods do not work for more general classes of functions. 
If f has a Fourier transform, we can use Corollary 6.16 to write 


9+ OG f. 


Thus the solution of the differential equation is given, formally, by 


A 1 
-1 
ylz) =F (foz) ©. 6.49 
(c) O24) (6.49) 
We know the inverse transforms of both f and (€? + 1)~!, but we have no 
obvious method for inverting the product of these two transforms, or even 
deciding whether such a product is invertible. Now we show how we can express 
F-*(f-g) in terms of f and g under relatively mild restrictions on the functions 
f and g. 
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Definition 6.20 


Let I be a real interval. A function f : I — C is said to be locally integrable on 
I if |f| is integrable on any finite subinterval of T. 


Thus all piecewise continuous functions on J and all functions in £! (T) are 
locally integrable. 


Definition 6.21 


If the functions f,g : R — C are locally integrable, their convolution is the 
function defined by the integral 


fega)= [fla —da(tae 
for all x € R where the integral converges. 


By setting x—t = s in the above integral we obtain the commutative relation 
fega)= f flsjgle—s)ds = g+ fle) 


The convolution f * g exists as a function on R under various conditions on f 
and g. Here are some examples. 

1. If either function is absolutely integrable and the other is bounded. Suppose 
f € L! (R) and |g| < M. Then 


Fels f E- DIOa 
cu fis (t)| dt < o0. 


2. If both f and g vanish on (—oo, 0), in which case 


f* g(x j= f He-not t)dt = [| re- ane 


3. If both f and g belong to £7(R). This follows directly from the CBS inequal- 
ity. 
4. If either f or g is bounded and vanishes outside a finite interval, then f * g 
is clearly bounded. 

But we need more than the boundedness of f * g in order to proceed. 
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Theorem 6.22 


If both f and g belong to £1(R) and either function is bounded, then f xg also 
lies in £1(R) and 
Ff * 9) = 9. (6.50) 


Proof 


For f x g to have a Fourier transform we first have to prove that |f * g| is 
integrable on R. Suppose, without loss of generality, that |g| is bounded on R 
by the positive constant M. Because 


f(x —tgt)| < M|f(@— t)| 


and M |f(x — t)| is integrable (as a function of x) on (—oo, co) for all t € R, it 
follows from Lemma 6.9 that the integral 


fega)= [fle dg(tat 
is uniformly convergent. Consequently, for any a > 0, we can write 


f trolde = 


dx 


fe- dga 


1 Pal f(a —t)g(t)| dtdex 
= ff ite eat ae 
= f Wl f ire a) dear 


< f WO f” Fe- 0) vat 


—0O —00O 


= f wola f” Vold: 


This inequality holds for any a > 0, therefore |f * g| is integrable on (0,00). 
Similarly, |f * g| is integrable on (—o0, 0) and hence on R. 
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To prove the equality (6.50) we write 


F(f*g)(E = S f(x — t)g(t)e 5 dtdz 
7 T F Mene e e idit 
fal f(s yeas f g(t)e 8" dt 


a(S), 


where the change in the order of integration is justified by the uniform conver- 


gence of the convolution integral. 


Using Equation (6.50) and the result of Example 6.3 we therefore conclude 
that a particular solution of y — y” = f is given by 


+1 
=fx (3e) (x) (6.51) 
i po ; 
=5 a x —tjelldt 
= sf. fel tdt. (6.52) 


We leave it as an exercise to verify, by direct differentiation, that this integral 
expression satisfies the equation y — y” = f, provided f is continuous. 

In this connection, it is worth noting that the kernel function e~!*~4l in 
the integral representation of y in Equation (6.52) is no other than Green’s 
function G(x, t) for the SL operator 

d2 


L= i 
ate (6.53) 


on the interval (—oo, 00), subject to the boundary conditions limz_.4.. G(x, t) = 
0; for the nonhomogeneous equation Ly = f is solved, formally, by 


ylz) = Lf = ia Ga, t) f (t)dt. 


This solution tends to 0 as  — +ov, as it should, being continuous and in- 


tegrable on (—oo,0o). Under such boundary conditions, the solution of the 


corresponding homogeneous equation y — y” = 0, namely c1e” + cze 
only be the trivial solution. 


, can 
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To solve the same equation y — y” = f on the semi-infinite interval [0, 00), 
we need to impose a boundary condition at x = 0, say 


y(0) = yo. 


In this case the same procedure above leads to the particular solution 


1 lo) 
woo) = 5 [feat 
2 Jo 
but here the homogeneous solution 
yn(@) = ce 7, x >0 


is admissible for any constant c. Applying the boundary condition at x = 0 to 
the sum y = yp + yn yields 


1 (oe) 
m=z | FO e, 
2 Jo 
from which c can be determined. The desired solution is therefore 
TSAR 1 >” 
ya) =f FO (v -3f soetat) ee. 
2 Jo 2 Jo 


If yo = 0, that is, if the boundary condition at x = 0 is homogeneous, then 


ue) =g FO (eet — eet) at, 


and, once again, we conclude that Green’s function for the same operator (6.53) 
on the interval [0, o0) under the homogeneous boundary condition y(0) = 0 is 
now 


G(z,t) = steel — e t, 


In Chapter 7 we show that the Laplace transformation provides a more effec- 
tive tool for solving nonhomogeneous differential equations in [0, 00), especially 
where the coefficients are constants. 


EXERCISES 


6.20 Prove the following formulas. 


Fif(w — all) =e f(E) 
Fle F(E) = FE -— a). 
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6.21 The Hermite function of order n, for any n € No, is defined by 
Palt) = e77? Hn(z), 00 < z < o0, 


where Hn is the Hermite polynomial of order n. Prove that Wn exists 
and satisfies 
ValS) = (~i) V 2m, (8). 
Hint: Use Example 6.17 and induction on n 
6.22 Solve the integral equation 
f u(x)eos gre = { r a 
0 


0, T<E< OO. 


6.23 Prove the integration formula (6.42). Hint: Denote the integral by 
I(z), show that it satisfies the differential equation 261'(z)+zI(z) = 
0, and then solve this equation. 


6.24 Assuming that 
To, xr|<a 
f(z) = { 9 |z| 


0, |z| >a 


in Example 6.18, prove that 


woo Blo (Si) (858) 


Compare this to the solution in Example 6.19, and explain the sim- 
ilarity between the two. 


6.25 Show that the solution of the boundary-value problem 


ut = kurz, O< 4<ow, t>0 
u(0,t) = 0, t>0 
u(z,0) = f(r), O<4< œ, 


where f is a piecewise smooth function in £1(0, 00), is given by 


1 ne 2 2 
u(x,t) = == le /4kt cag PEE) /4kt| qy, 
e) = f to y 
This is a mathematical model for heat flow in a semi-infinite rod 
with one end held at zero temperature. 


6.26 If f(x) = 1 on (0, a) and 0 otherwise in Exercise 6.25, express u(x, t) 
in terms of the error function. 
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6.27 


6.28 


6.29 


6. The Fourier Transformation 


Use the Fourier transform to solve the initial-value problem for the 
wave equation 


Un = Yzer, —OO<a<co, t>0 
u(x, 0) = f(x), —œ < x< œ 
u(x,0) = 0, —00 < T < 00, 


where f is assumed to be a piecewise smooth function in £1(R). 
Show that the solution can be expressed as 


u(a,t) = = J flecos cté el dé, 


and derive the d’Alembert representation 


u(x,t) = AG + ct) + f(a — ct)). 


Solve the wave equation on the semi-infinite domain 0 < 2 < c 
under the initial conditions in Exercise 6.27 and the boundary con- 
dition u(0,t) = 0 for all t > 0. 


Verify, by direct differentiation, that the integral expression (6.52) 
solves the equation y — y” = f. 


7 


The Laplace Transformation 


If we replace the imaginary variable i£ in the Fourier transform 
f= [sae *as 


by the complex variable s = ø + i£, and set f(x) = 0 for all x < 0, the function 
defined by the resulting integral, 


F(s) = i f(a) dz, (7.1) 


is called the Laplace transform of f. When Res = ø is positive, the improper 
integral in Equation (7.1) exists even if |f| is not integrable on (0,00), such as 
when f is a polynomial, and herein lies the advantage of the Laplace transform. 
Because of the exponential decay of e~°”, for o > 0, the Laplace transformation 
is defined on a much larger class of functions than £1(0, 0c). 


7.1 The Laplace Transform 


Definition 7.1 


We use E to denote the class of functions f : [0,co) — C such that f is locally 
integrable in [0,co) and f(x)e~°” remains bounded as x — oo for some real 
number a. 
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1 ọ9— Hix) 


Figure 7.1 The Heaviside function. 


Thus a function f € E is characterized by two properties: first, that f 
is allowed to have singular points in [0,00) where f is nevertheless locally 
integrable; and, second, that f has at most exponential growth as x — oo, that 
is, that there is a real number a such that f(x) is dominated by a constant 
multiple of the exponential function e~°” for large values of x. More precisely, 
there are positive constants b and c such that 


|f(x)| < ce” forall z > b. (7.2) 


For the sake of convenience, € will also include any function defined on R 
which vanishes on (—oo,0) and satisfies Definition 7.1 on [0,00). If we now 
define the Heaviside function H on R\{0} by (see Figure 7.1) 


1, x>0 
He = { 0, «<0 


then the following functions all lie in £E. 

(i) H(x)g(x) for any bounded locally integrable function g on R, such as cos x or 
sin x. Here a, as referred to in Definition 7.1, can be any nonnegative number. 
(ii) H(x)p(a) for any polynomial p. Here a is any positive number. 

(iti) H(x)e**, a > k. 

(iv) H(x)log x, a any positive number. 

(v) H(x)a", > —1, a any positive number. 

But such functions as H(x)e® or H(x)2°" do not belong to £. In the above 
examples of functions in E£, we take f(0) to be f(0*), and it is of no consequence 
how f is defined at x = 0. That is why we chose not to define H at that point. 
The rate of growth of a function f is characterized by the smallest value of a 
which makes f(x)e~°*” bounded as z — oo. When such a smallest value exists, 
it is referred to as the exponential order of f, such as a = 0 in (i) anda =k 
in (iii). In (ii), (iv), and (v) a can be any positive number. A function which 
satisfies the inequality (7.2) is said to be of exponential order. 
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Definition 7.2 


For any f € E, the Laplace transform of f is defined for all Res = o > a by 
the improper integral 


eo = f ~ Hæ tda. (7.3) 


Basically, this definition is tailored to guarantee that f(x)e7®? lies in 
£1(0,00). By the estimate (7.2), 


O 
0 


we see that the integral (7.3) converges for all o > a. The convergence is 
uniform in the half-plane o > a + for any € > 0, where the integral can be 
differentiated with respect to s by passing under the integral sign. The resulting 
integral 


dx < ef e Vd < forall o >a, 
0 


-f Hue i 


0 
also converges in o > a + £. Consequently the Laplace transform L(f) is an 
analytic function of the complex variable s in the half-plane Res > a. As in 
the case of the Fourier transformation, the Laplace transformation £ is linear 
on the class of functions €, so that 


Llaf + bg) = al(f) + bL(Qg), a,b E€ C, f,gEE. 


Example 7.3 


The constant function f(x) = 1 on [0, 00) clearly belongs to €, with exponential 
order a = 0, and its Laplace transform is 


oe 1 
L(1)(s) = f e 7dr = 3 sEC, Res > 0. 


Similarly, the function f : [0,00) — R defined by f(x) = a”, n € N, also 
belongs to €, with a > 0. For all Res > 0 we have, using integration by parts, 


Lla")(s) = a © phen dy 


co 
n =, ee: 
=— x” le tdr 
5 Jo 
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! fo. e) 
= s e dr 
S Jo 
n! 
n! 
> L(a2")(s) = nti’ n € No. 


The integrability of | f(a)e~**| does not depend on Im s = €, therefore it is 
more convenient, for the sake of evaluating the integral (7.3), to set Im s = 0. 
Being analytic on Res > a in the complex s-plane, the function F = L(f) is 
uniquely determined by its values on the real axis s > a. 


Example 7.4 


For any > —1, the Laplace transform 


cat) = | whe tdr 
0 


exists because the singularity of x“ as x — 07 is integrable. To evaluate this 
indefinite integral, we take s to be real and set sx = t. If s is positive, 


T ENS dt 
N a 

0 S s 
1 oO 

-=f e™tt"dt 
sl 0 


1 
= seri! (u + 1), s>0. 


L(x") 


This result can now be extended analytically into Res > 0, and it clearly 
generalizes that of Example 7.3. 


Example 7.5 


For any real number a, 


L(e%)(s) = i Petia. ghy 
0 


Hence, 
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On the other hand, 


s>0. 


; 99 : 1 
L(e%”)(s) — f e~ (siat dr = 


6 s—ia’ 


Therefore 


£(sinax)(s) = = ( ) 


s—ia s+ia 
a 
To recover a function f from its Laplace transform L(f) = F, we resort 
to the Fourier inversion formula (6.16). Suppose f € E so that |f(x)e~°*| is 
integrable on [0,0o) for some real number a. Choose 8 > a and define the 


function f 
_ FE, xz>0 
gla) = { 0, x <0, 


which clearly belongs to £L! (R). Its Fourier transform is given by 
HO) = [fae Pear = LNP + if) = F(E + i8); 
0 


If we further assume that f is piecewise smooth on [0, 00), then g will also be 
piecewise smooth on R, and we can define g to be the average of its left and 
right limits at each point of discontinuity. By Theorem 6.9, 


With 2 fixed, define the complex variable s = 8 + ié, so that ds = id and the 
integral over the interval (—L, L) becomes a contour integral in the complex 
s-plane over the line segment from 6—iL to 6+iL. Thus 


B-iL 
and 
1 B+iL 
f(x) = lim = | F(s)e"*ds, (7.4) 
Loo 2771 B-iL 


which is the desired inversion formula for the Laplace transformation £ : f — F. 
Symbolically, we can write (7.4) as 


f(x) =L"(F)\(a), 
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where 


is the inverse Laplace transform of F. 

One should keep in mind that (7.4) is a pointwise equality. It does not 
apply, for example, to functions which have singularities in [0, 00), for these are 
excluded by the assumption that f is piecewise smooth. Furthermore, because 
f(07) =0, we always have 


B+iL 
lim = | F(s)ds = =f(0*), (7.5) 


The contour integral in (7.4) is not always easy to evaluate by direct integration. 
On the contrary, this formula is used to evaluate the integral when the function 
f is known. The same observation applies to (7.5). For example, with reference 
to Example 7.3, if € > 0, 


1 etiL 1 0, z<0 
jim af -e”°ds = | 1/2, z=0 
RT 1, x>0. 


The inversion formula clearly implies that the Laplace transformation £ is 
injective on piecewise smooth functions in E. Thus, if F = £(f) and G = L(g), 
where f and g are piecewise smooth functions in €, then 


F=G => f=g. 


EXERCISES 


7.1 Determine the Laplace transform F(s) of each of the following func- 
tions defined on (0, œo). 


(a) f(w) = (ax + b)? 
(b) f(x) = cosh x 
(c) f(x) = sin? z 
(d) f(x) = sin z cos x 
C, araa 
(). Fa) 0, ax<r<w 


a C<a<b 


0, x>b 
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7.2 Determine the inverse Laplace transform f(x) for each of the follow- 
ing transform functions. 


(a) F(s)= 5 

b) F(s)= 3? 
1 

(c) F(s) = 3(s41) 

@ Fs) =a 

© F(s) = ED 


(£) F(s) =1/s°/? 
14s? + 55s +51 
F(s) = , 
EPES ap B rar 
7.3 If a > 0, prove that 


£(F(ax))(s) = F (s/a), 
where F(s) = L(f(x))(s). 


7.2 Properties and Applications 


As with the Fourier transform, when a function and its derivative have Laplace 
transforms, the transform functions are related by a formula which is easily 
derived by using integration by parts. We should recall at the outset that a 
piecewise continuous function is always locally integrable. Hence all piecewise 
continuous functions of exponential growth lie in E. 


Theorem 7.6 


(i) If f is a continuous and piecewise smooth function on [0, co) such that both 
e°* f(x) and e~°* f'(x) are bounded, then 


L(f')(s) = s£(f)(s) — FOH), Res >a. (7.6) 
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(ii) If f is a piecewise continuous function on [0,0o) and e~°* f(x) is bounded, 
then i 
L (Jo f(t)dt) (s) = SEM); r>0, Res>a. (7.7) 


Proof 


(i) The assumptions on f guarantee the existence of both £(f)(s) and L(f’)(s) 
on Res > a. Integrating by parts, 


cles) = | Pede 


= e~e f(a) + sf f(aje 7de 
= sL(f)(s)— f(0*), Res >a. 


Note that the continuity of f is needed for the second equality to be valid (see 
Exercise 6.9). 
(ii) Let 


ate) = f fod. 


Because the function |e~°* f(a)| is bounded on [0,00) by some positive con- 
stant, say M, we have 


hence 


ax 


This implies that e~°”g() is bounded on [0,00). Now g is continuous and its 
derivative g’ = f is piecewise continuous on [0,00), so we can use part (i) to 
conclude that 

L(f)(s) = L(g')(s) = s£L(g) — g(0"). 
With g(0*) = 0 we arrive at the desired result. The case where a = 0 is 
straightforward. 


By induction on n, we can generalize (7.6) to arrive at the corresponding 
formula for the nth derivatives of f, and we leave the details of the proof to 
the reader. 
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Corollary 7.7 


Let f, f’,...,f@~» be continuous and piecewise smooth functions defined on 
(0,00). If f, f’,...,f™ are of exponential growth, then f( has a Laplace 
transform on Res > a, for some real number a, given by 


L(f™)(s) = s"£(f)(s) — 8”! (0+) _ 3s" f'(0+) Sta fD (0+). (7.8) 


Example 7.8 
(i) Using the result of Example 7.5 and Theorem 7.6, 


L(cosax)(s) = £ (+ £ sin ax) (s) 


1 a 
wera 8 


(ii) Given 


a s(s? — 1)’ 


we can apply the formula (7.7) to obtain 


1 1 í 
fear (+3 — :) (x) = f sinh tdt = cosh x — 1. 

According to Theorem 7.6, differentiation of f is transformed under £ to 
multiplication by s, followed by subtraction of f(0*), and integration over (0, x) 
is transformed to division by s. Conversely, if f € E and L(f) = F, then we 
can show that 


L(axf)(s) = = 7 L(f)(s) = —F'(s) (7.9) 
L(f /x)(s ay a F(z (7.10) 
where it is assumed in (7.10) that the function f(x)/z is locally integrable, and 


the contour of integration in the z-plane from s to oo is such that Im z remains 
bounded. We leave it to the reader to prove these two formulas. 


230 7. The Laplace Transformation 


7.2.1 Applications to Ordinary Differential Equations 


Theorem 7.6 and its corollary make the Laplace transform an effective tool for 
solving initial-value problems for linear ordinary differential equations, espe- 
cially when the coefficients in the equation are constant. It reduces the prob- 
lem to first solving an algebraic equation and then taking the Laplace inverse 
transform of the solution. Consider, for example, the second-order differential 


equation 
y” +ay' +by= f(z), «x>0, (7.11) 
where a and b are constants, under the initial conditions 
yO"°)=y, yO )=n. (7.12) 


Using Corollary 7.7, with L(y) = Y and L(f) = F, we obtain 
s*Y — syo — yı + a(sY — yo) + bY =F. 
Hence 
F(s) yo(s +a) + yı 
s? +as +b s? +as+b 


Y (s) = 


and the solution is given by 


ylz) = L(Y) (2) 
-ı |__F(5) _ p-1 | yols +a) + yı 
=e Jae] +e ae Jo 
= Yp(Z) + yn(Z). 


Yp is, of course, a particular solution of the nonhomogeneous equation (7.11), 
and yn is the corresponding homogeneous solution. This method provides a 
convenient way for obtaining y,, especially when the function f is not continu- 
ous. The homogeneous part of the solution is simply the inverse transform of a 
rational function in s, which may be evaluated by using partial fractions, and 
which vanishes when the initial conditions on y and y’ are homogeneous, that 
is, when yo = yı = 0. Thus the nonhomogeneous terms in either the differential 
equation or the boundary condition merely add more terms to the transform 


function Y, but otherwise pose no extra complication. 

Equations (7.11) and (7.12) in fact represent a mathematical model for 
current flow in an RLC electric circuit. As shown in Figure 7.2, this consists of 
a resistance, measured by R, an inductance L, and a capacitance C’, connected 
in series and supplied with an input voltage v(t), which depends on time t. If 
the current in the circuit is denoted y(t), then the drop in potential across each 
of the three circuit elements is given, respectively, by 


wm= Ful), el) =v), ve = f alr ar 
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v(t) L 


€ 


Figure 7.2 RLC series circuit. 


According to Kirchoff’s law, the sum of these potential drops is equal to the 
input voltage, 
1 f 
Ry + Ly’ + al y(r)dr = v(t). 
C to 
By differentiating with respect to t and dividing by L, we end up with the 
second-order equation 
R 1 
pr / 
aie —_y= f(t 
yt sy + ray = FO, 
where the coefficients are constant and f = v'/L is given. If y and y’ are 
specified at t = 0, when the circuit switch is closed, then we have a system of 


equations similar to Equations (7.11) and (7.12). The current is determined by 
solving the system along the lines explained above. 


Example 7.9 
Solve the initial-value problem 


y” +4y' + 6y = e~, t>0, 
y(0)=0,  y'(0)= 


Solution 
The Laplace transform of the differential equation is 


1 


s?Y +4sY + 6Y = 
s+1 


? 


and the transform function is 
1 1 


YO = GENEE) erer 
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To invert this rational function we use partial fractions to write 


1 1 1 s+2 1 1 


Y (s) = l 
e 3551 eea e A 


By the linearity of the inverse transformation, we therefore have 


1 1 s+2 
t) = —t —1 
WO) = get E 


Jo so | | (0. (7.13) 


To evaluate the last two expressions, we need to determine the effect of 
translation in s on the inverse Laplace transform £~!. This is provided by the 
following translation theorem. 


Theorem 7.10 
If f € E and L(f)(s) = F(s) on Res > a, then 


L(e* f)(s) = F(s—a), 8$—a>a, (7.14) 


L|H(« — a)f(a—a)\(s) =e “*F(s), a>0, s>a. (7.15) 


Proof 
Equation (7.14) is a direct result of the definition of the Laplace transform, 
L(e% f)(s y= f° f(x (s— a)z dx 
= F(s- a), s-a>a. 


To prove (7.15), note that 


e “F(s) = T f(aje 82+ dr 


7.2 Properties and Applications 


Going back to Equation (7.13) in Example 7.9, since 


fo € =) (t) = cos V2 


ai 1 olaf y2 \ 1, 
L£ (a0 (25) = aeva 


we can apply the formula (7.14) to obtain 


ae Sy (t) = e7” cos V2t 


1 1 
Sy S t)= — —2t a: Ot. 
Gaara] O= yrs 
The solution (7.13) is therefore given by 
1 1 1 
t) = se" — | = cos V2t in V2t ) e7”. 
y(t) ze (08 v2 + ign v) e 


Remark 7.11 
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One may wonder why the second-order equation in the above example has to 


have two initial conditions at x = 0 to determine its solution, whereas a similar 
equation, y — y” = f with 0 < x < œ, which was discussed at the end of 
Chapter 6, required only one condition, namely y(0) = yo. The reason is that 
this last equation was solved using the Fourier transformation under the tacit 
assumption that y is an £! function which, being continuous, tends to 0 as 


x — oo. This constitutes a second boundary condition. 


In Example 7.10 we could just as well have used the conventional method 


for solving differential equations with constant coefficients. But in the following 


example, where the nonhomogeneous term is not continuous, there is a clear 


advantage to using the Laplace transform. 


Example 7.12 
Solve the initial-value problem 
y'+3y= f(t), t>0, 
y(0) = 1, 


where 
0, t<0 
f®H®=¢ t, 0<t<l 
0, tl. 
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Figure 7.3 


Here y(t) represents the current in an RL circuit where the input voltage f(t) = 
t is switched on at t = 0, and then switched off at t = 1. 


Solution 
By expressing f in the form f(t) = t|H(t) — H(t — 1)], 
we can use the formula (7.9) and Theorem 7.10 to write 


L(f)(s) = -Leno — H(t — 1))](s) 


(s+3)Y(s)-1= 


s2 s2 S 
ee E Jii 1l 1 
gts” sra gsr C sF 


With 1 
aoe Ae es os —3t H t 
ae (=) : ©, 
we use (7.7) to write 
$ 
it G : ) (t) zA ode = lä — e™%)H(t) 
E3 ; 3 


c! (=) (t) = [ F0-e%ar= . (« | Ss 5) HO. 


Now (7.15) implies 


mle) 
eed b)o- 


[pe ""- Ae — 1) 


Wl wile 


[e =j sey = ‘| H(t—1), 
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hence 


3J g! 
10 + 2e3 
N, tS: 


Although f is discontinuous at « = 1, the solution y is continuous on (0,00). 
This is to be expected, because the inductance L does not allow sudden changes 
in the current in an RE circuit. 

The method used to arrive at the solutions in Examples 7.8, 7.9, and 7.12 
forms part of a collection of techniques based on the properties of the Laplace 
transformation, known as the operational calculus, which was developed by 
the English physicist Oliver Heaviside (1850-1925) to solve linear differential 
equations of electric circuits. This method is not limited to equations with 
constant coefficients, as the next example shows. 


Example 7.13 
Under the Laplace transformation, Laguerre’s equation 
xy” +(1—2)y’ + ny = 0, x 10; n € No, 


becomes 


Ziy sy(0) — y'(0)] + sY — y(0) + Lisy —y(0)]+nY =0 


(s—8°)Y'+(n+1-s)Y =0 
Y’ n+1l-s n n+1 
Y s(s—1) s-—-l1 s 
Integrating the last equation, we obtain 
(s- 1)” 
soti o”? 


Y(s)=c 


where c is the integration constant. Taking the inverse transform, 


va) = et |S") (e) 


= ce” L7! 


ao 
c , da” 


= Pile ane T”). 


Laguerre’s polynomial L,, is obtained by setting c = 1. 
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7.2.2 The Telegraph Equation 


The Laplace transform can also be used to solve boundary-value problems for 
partial differential equations, especially where the time variable t is involved; 
for then the transform can usually be applied with respect to t over the semi- 
infinite interval [0, 00). Consider the equation 


Uge = Aun + Bu; + Cu, 0<t<a, (7.16) 


where A, B, and C are nonnegative constants. This equation, known as the tele- 
graph equation, describes an electromagnetic signal u(x,t), such as an electric 
current or voltage, traveling along a transmission line. The constants A, B,C 
are determined by the distributed inductance, resistance, and capacitance (per 
unit length) along the line (see [7], vol. 2). If the transmission line extends over 
—o0 < £ < œ, then two initial conditions (at t = 0) on u and uz are sufficient 
to specify u. But if it extends over 0 < x < oo, then we need to specify u at 
x = 0 as well, in which case the boundary conditions take the form 


u(0, ) = f(), t>0, 


Here f(t) is the input signal which is transmitted, and the two initial conditions 
are needed because Equation (7.16) is of second order in t (if A = 0 we do 
not need the condition on u+). We can attempt to solve Equation (7.16) by 
separation of variables, using the Fourier integral as we did previously, but the 
fact that all the boundary conditions are nonhomogeneous makes the procedure 
more difficult. It also places unnecessary restrictions on the behaviour of u and 
fast— oo. 

We can simplify things by solving Equation (7.16) under two sets of bound- 
ary conditions: first with f = 0, and then with g = h = 0. Because (7.16) is 
linear and homogeneous, the sum of these two solutions is also a solution of 
the equation, and it satisfies the three nonhomogeneous boundary conditions. 
In both cases, the Laplace transform can be used to construct the solution. Let 
us take the second case, where the boundary conditions are 


u(0,t)= f(t), t20, 
u(a,0) = uz(x,0) = 0, x>0. 


Assuming that f, Uu, Uc, Ure, Ut, and uy all lie in E as functions of t, and 
that u and its first partial derivatives are piecewise smooth functions of t, we 
can apply the Laplace transformation with respect to t to Equation (7.16) to 
obtain 
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Use = (As? + Bs + CVU, (7.17) 


where 


U(a,s) = n u(x, the “dt (7.18) 


and Res is greater than some real number a. If, for example, the signal u is 
bounded we may take a = 0. Note that differentiation with respect to x may 
be taken inside the integral (7.18) because the latter converges uniformly on 
Res > a +e for any e€ > 0. For every fixed value of the parameter s, Equation 
(7.17) has the general solution 


U(x, s) = e1(s)e*9)® + e9(s)er*O), 


where cı + c2 = U(0,s) = L(f)(s) = F(s) and A(s) = VAs? + Bs + C. If we 
take the principal branch of the square root, where Re A(s) > 0, the solution 
e*(5)” becomes unbounded in the right half of the s-plane and must therefore 
be dropped (by setting cı = 0). The resulting transform function 


U(x,s) = F(s)e)* 


can, in principle, be inverted to yield the desired solution u(x,t) = £~1(U)(z, t), 
although the computations involved may be quite tedious. Nevertheless, there 
are a number of significant special cases of Equation (7.16) where the solution 
can be obtained explicitly. 

(i) If A= 1/2 and B = C =0, (7.16) reduces to the wave equation 


Utt = Uzr- 
Here the transform function 
U(x, s) = e**/°F(s) 
is easily inverted, using formula (7.15), to 
u(x,t) = H(t — a/c) f(t—2/c). 


This expression indicates that there is no signal at a point x until t = w/c. Thus 
the signal f(t) moves down the line without distortion or damping and with 
velocity c. This is to be expected, because the damping factor Bu; is absent 
and, consequently, no energy is dissipated in the transmission. 

(ii) If As? + Bs +C = (s+ b)?/c? is a perfect square, then 


U(x, s) = e~ (+4)#/¢ F(s) 
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and the solution is 


u(x,t) = e? H(t — x/c) f(t- z/o). 


Here the signal f(t) again moves down the line with velocity c, undistorted, but 
attenuated by the damping factor e7?®/° 
distortionless if it is designed so that the distributed resistance, inductance, 
and capacitance are such that the relation B? = 4AC holds. The interested 
reader may refer to [10] on this point and on some other interesting historical 
background to the telegraph equation. 

(iii) If A = C = 0 and B = 1/k, where k is a positive constant, we obtain the 
heat equation 


. A transmission line therefore becomes 


Ut = kuga. 


In this case the Laplace transform of u is 
U(x, s) = e7®V S/E F(s), (7.19) 


which is not readily invertible. To determine £~'(U) we need another rule 
which allows us to invert the product of two transforms. By analogy with the 
Fourier transformation we should expect this to involve the convolution of the 
inverse transforms. 

If the functions f and g are locally integrable on [0, 00), then their convo- 
lution 


OE | “fle be@ar= J FOE- OS G2F@) 


is well defined and locally integrable on [0,0o). For if [a,b] is any finite interval 
in [0,00), then 


b b x 
[irxa@lars f f ife- oldas 
b b 
= | | He-Dæ- DII dtar 


b b 
a / He- o |Ne— dae la(t)| at 


b [ pb—t 
< | | HIFO) a I(E) dt. 
0 a—t 
Because f is locally integrable and 0 < t < b, the integral of H|f| over [a—t, b—t] 
is uniformly bounded by a constant, and, because g is also locally integrable, 
| f * g| is integrable on [a,b]. The convolution fxg is also (piecewise) continuous 
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if either f or g is (piecewise) continuous, and (piecewise) smooth if either f or 
g is (piecewise) smooth (Exercise 7.16). 

If f(x) and g(x) are dominated as x — oo by e®”, then one can easily check 
that f x g(x) will be dominated by ef” for any 3 > a. Consequently, if f and 
g belong to €, then so does their convolution f * g, and its Laplace transform 


is given by 
L(f * g)(s y= foe =f p09 (x — t)dtdx 


s i H(x —t)f(t)g(a — the dtdx 
= i O) ie aye dydt 
= L(F)L(9)(8). 


In the third equality, the order of integration is reversed, and this is justified by 
the uniform convergence of the double integral on Res > 8 +e for any positive 
e. Thus we have proved the following convolution theorem which corresponds 
to Theorem 6.22 for the Fourier transformation. 


Theorem 7.14 
Let f,g € E. If L(f)(s) = F(s) and L(g)(s) = G(s), then 


L(f * g)(s) = F(s)G(s). 
Now we can go back to Equation (7.19) to conclude that 
u(x,t) = £7} (er#v*/* F(s)) (t) 
SJ Cag (t). 


The function Lo1(e-2vs/k) (t) can be evaluated by using the inversion formula 
(7.4), which requires some manipulations of contour integrals (see Exercise 
7.21), or it may be looked up in a table of Laplace transforms. In either case 


—1 f „,—z4/s/k _ T —x?/4kt 
L£ (e \@= JG ; 


hence 


2 
u(x,t) y Gog eer ae. (7.20) 
~ ark 


Here the solution u differs eee from that in the first two cases. It tends 
to 0 as x — œ at any time t and also as t > œ at any point x, and the signal 
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f is distorted as it spreads along the line with no fixed velocity. This is not 
surprising, as the original equation (7.16) is radically changed by dropping the 
first term. It changes from a hyperbolic equation, with wavelike solutions, to 
a parabolic equation whose solutions travel in a diffusive manner, in much the 
same way that heat or gas spreads out. 


EXERCISES 


7.4 Sketch each of the following functions and determine its Laplace 
transform. 


d) H(x — T /2)cos x 
e) (1—e~*)[H(x) — H (x -— 1)]. 
7.5 Use the Heaviside function to represent the function 


T O0<xr<l1 


fa) ={ el-® a> 


by a single expression. Sketch the function and determine its Laplace 
transform. 


7.6 Determine the inverse Laplace transform of each of the following 
functions. 


s*+2s4+2 


1 
(e) <(e-* + e=) 
1 
s—1 
1l+e 78 
S249 ` 


(e738 + e~*) 


(e) 


7.7 If f is not assumed to be continuous in Theorem 7.6(i), what would 
Equation (7.6) look like? 
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7.8 Solve the following initial-value problems: 
a) y” + 4y’ + 5y =0, (0) =1, y/(0) =1 

b) 9y” — 6y' +y =0, y(0) = 3, y 

c) y” + 2y' + 5y = 3e sinz, y(0)=0, y/(0) =3 

d) y” +2y' — 8y = —256x°, y(0) 


II 
nh 
on 
ee 
fae 
Co 
SS 
II 
w 
for) 


( 
( 
( 
( 
( 
( 


e) y” — 3y' + 24y = H(t—1), y(0) =9, y’ (0)=1 
f) y + 2y = 2[H(x) — H(z — 1)] 
A _ fj sing, O<a<T _ Rae 
OPETE ee Keke , y(0) =0, y/(0) =0. 
7.9 Invert each of the following transforms. 
s 
a 


(d) cot~+(s + 1). 


7.10 The function Si: R — R, defined by the improper integral 


j= [= sint 


is called the sine integral. Prove that 


£ (=) (s) = tan`! C) 


L{Si(z)|(s) = L tan“! (5) , 


7.11 (a) Let f € E be a periodic function on [0, 00) of period p > 0. Show 
that 


P 
L(f)(s) = a. flae~"de, 3 >0. 


(b) Use this to compute £(f), where f(a) = x on (0,1), f(x + 1) = 
f(x) for all x > 0, and f(x) =0 for all  < 0. 


7.12 Prove that £(f)(s) > 0 as Res — oo for any f € E. 
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7.13 


7.14 


7.15 


7.16 


7.17 


7.18 


7.19 
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Use the Laplace transformation to solve the integral equation 
i (x — t)?y(t)dt = f(x), and state the conditions on f which ensure 
that the method works. 


If f(x) = e” and g(x) = 1/yrz prove that 
f * g(x) = e? erf( V2). 
Use this to evaluate £[e” erf(./x)| and Llerf(./z)). 


Determine L([x]), where [x] is the integer part of the nonnegative 
real number 7; that is, 


[cz] =n for alla € [n,n +1), n € No. 


Given two locally integrable functions f, g : [0, o0) — C, show that 


(a) f * g is continuous (smooth) if either f or g is continuous 
(smooth). 


(b) fxg is piecewise continuous (smooth) if either f or g is piecewise 
continuous (smooth). 


Use the Laplace transformation to obtain the solution of the wave 
equation 
Utt = C Uss, x >0, t>0, 


subject to the boundary condition 
u(0, t) = cos? t, t>0, 
and the initial conditions 


u(x,0) = uz(x,0) = 0, O0<a<l. 


Solve the boundary-value problem 
ut — au = kee, x >0, t>0, 
u(0, t) = f(t), t= 0, 
u(z,0) = 0, x>0. 


Solve the boundary-value problem 


Utt + 2up + U = Cruze, x >0, t>0, 
u(0,t) = sint, t>0, 
u(x, 0) = u(x, 0) = 0, x>0. 
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7.20 In Equation (7.20), prove that u(x,t) — f(t) as z > 0. 
7.21 Use contour integration in the s-plane to prove that 
Ga = nel, 
where a > 0. Conclude from this formula that 
Loew) cape ae 
2V Tr’ 
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Solutions to Selected Exercises 


Chapter 1 


1.2 


1.4 


1.7 


1.14 
1.17 
1.21 
1.22 
1.23 


(a) Complex vector space, (b) real vector space, (c) not a vector space, 
(d) real vector space. 


Assume that the numbers are different and show that this leads to a 
contradiction. Assuming {21,...,%,} and {y1,...,Yn41} are bases of the 
same vector space, express each y;, 0 < i < n, as a linear combination 
of £1,...,£n. The resulting system of n linear equations can be solved 
uniquely for each x;, 0 <i < n, as a linear combination of y,,0<i<n 
(why?). Since yn+1 is also a linear combination of x1,...,2%, (and hence 
of y1,---,Yn), this contradicts the linear independence of y1, ..., Yn441- 


Recall that a determinant is zero if, and only if, one of its rows (or 
columns) is a linear combination of the other rows (or columns). 


Use the equality |x + y||? = ||x||? +2 Re (x, y) + |ly||?. Consider x = (1, 1) 
and y = (i, i). 


(a) 0, (b) 2/3, (c) 8/3, (a) VTA. 
(ffi) / fall = VT (Ff, fe) / fall = 0, (F fs) / I fsll = v72 


No, because |x| = x on [0, 1].. 
a= —1, b = 1/6. 
Use the definition of the Riemann integral to show that f is not integrable. 


(i) 1/V2, (ii) not in £?, (iii) 1, (iv) not in £? 


Examine the proof at the beginning of Section 1.3. 
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1.24 Use Exercise 1.23 to show that f and g must be linearly dependent. 
Assuming g = af show that a > 0. 


1.25 a > —1/2. 

1.26 a < —1/2. 

1.28 Use the CBS inequality. f(x) = 1/,/z on (0, 1]. 
1 

1.30 sin? z = 1 sin £ — 1 sin 3x. 


1.32 Use the fact that, for any polynomial p, p(a)e~* — 0 as x > œœ. 


1.34 (a) The limit is the discontinuous function 


1, \z| >1 
lim s” 0, jal <1 
noo l+ 1/2, r= 


undefined, x= —1 


1.35 (a) Pointwise, (b) uniform, (c) pointwise. 


1.37 Pointwise convergent to 


fs eee 


l-r, O<a<l. 


1.40 The domain of definition of fn is not bounded. 
1.42 (a) R, (b) z # —1. 

1.50 (a) 1, (b) 1, (c) 0. 

1.51 (a) Convergent, (b) convergent, (c) divergent. 
1.57 cy = 1, c2 = —2/m, cg = —1/r. 


1.58 ago = T/2, ay = —4/n, a2 0, bi 0, bo 0. 


1.61 ap =1/n. L? convergence. 


Chapter 2 


2.1 (a) y = e?*(c,cosV/3x + cosin/32) + e7 /4. 
(b) y= ar? + c2 + z’. 


1 
(c) y = £7} (c1 + cglogx) + 127 1. 
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2.3 


2.5 
2.7 
2.8 


Cn42 = — zH Cn, y = co (1 — 22? + ĝt +) +c (£= x? + 4+), 
ja] <1. 
A second-order equation has at most two linearly independent solutions. 


(a) y” + 2y' + 5y = 0, (b) z?°y” + xy’ — y =0, (c) zy” +y' =0. 


Use Lemma 2.7 and the fact that a bounded infinite set of real numbers 
has at least one cluster (or limit) point. This property of the real (as well 
as the complex) numbers is known as the Bolzano—Weierstass theorem 
(see [1]). 


2.10 Use Theorem 2.10. 
2.12 The solutions of (a) and (c) are oscillatory. 
2.13 y = 2~\/2(cycosx + csing). The zeros of z~'/?cosx are {3 + nm}, and 
those of 2~1/?sing are {nr}, n € Z. 
2.15 Use Theorem 2.10. 
2.17 (a) =O”, A €C, (b) eV”, Re VÀ > 0. 
2.19 (a) p = 1/z?, (c) p = e7, 
2.21 p= e°”. An = n?n? +1, Un(@) = e7 ”sinàns. 
2.23 Àn = n?T?(b — a)7?, un(x) = sin( “9 ) . 
b-a 
2.27 (a), (b), (c), and (£f). 
2.29 Change the independent variable to € = x +3 and solve. An = (=) + 
0g 
1 
Fe unt) = (w+ 3)-1?sin (5 loge +3)). 
2.31 Refer to Example 2.17. 
2.32 Multiply by @ and integrate over [a,b]. 
Chapter 3 
3.2 No, because its sum is discontinuous at x = 0 (Example 3.4). 
4 oo 1 
3.4 T- |z| = + m Sra Qn +i cos(2n + 1)x. Uniformly convergent by 


3.6 


the Weierstrass M-test. 


Use the M-test. 
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3.9 


3.11 


3.27 


3.29 
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(a) and (c) are piecewise continuous; (b), (d), and (e) are piecewise 
smooth. 


Use the definition of the derivative at x = 0 to show that f’(0) exists, 
then show that a f'(x) does not exist. 
1)”+ 


=) Ji ~ sinne. 
n=1 n 


3 1 
(d) cos? z = {cost t gcos 3a. 


The convergence is uniform where f is continuous, hence in (b), (c), and 
(d). 
In Exercise 3.15 (e), S(£2) = $(e? + e~?), and in (f) $(41) = 0. 


T 3) ae (Qn +1) 


=o ay —— cosnz. Evaluate at z = 0 and z = r. 
n=1 n2 


f’ is an odd function which is periodic in m with f'(x) = cosx on [0,7], 
hence S(x) = }>" | bysinker where by = Æ (HEP) > 1, b = 0. 
S(nT) = 0, S(5 + nr) = f'(§ + nm) =0. 


3 1 
(a) u(x,t) = ge sine — ge sin3e. 


(b) ulz, t) = en hr t/4sin = en 25k 74/36 gj OME 


oo 2 l 
u(x,t) = Ee ansin "x cos Tt an = a x(l x)sin "a dz. 


Assume u(x,t) = v(x, t) + y(x) where v satisfies the homogeneous wave 
equation with homogeneous boundary conditions at x = 0 and x = l. This 
cnn 


oo NT 
leads to p(x) = 54¢(2? — lx), and u(x,t) = J, nsina cos Tt 


9 pl 
with a, = -7f Y(x)sin Ze dx. 

ra i 
Assume u(x, y, t) = v(x, y)w(t) and conclude that w” /w = c? Av/v = A? 
(separation constant). Hence w(t) = A cos At+ B sin At. Assume v(x, y) = 
X(x)Y (y), and use the given boundary conditions to conclude that 
— + -5 T, m,n EN, 

n 
X(x) = sin —a, Y (y) = sin Ty, 
a 
Umn(2, Y, t) = (AmnCosAmnct + BmnsinAmnct)sin a sin Ty. 
a 
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Apply the initial conditions to the solution 


u(x, y,t SS ae 


n=1m=1 


to evaluate the coefficients. This yields 


Ann -5 [is f(a, y)sin M esin Ty dzdy, 
Bag = af a g(a, y)sin a sin ay dady. 


0 3r\ T}. 3m. 3r 
3.35 u(x, y) = sinh — sin -7 sinh au: 


3.37 (b) Use the fact that u must be bounded at r = 0 to eliminate the coef- 
ficients dn. 


(c) u(r, @) = Ao + ae (=) (A, cosné + Bysinné). 


Chapter 4 
4.3 From the recursion formula (4.7) with k = 2j, it follows that 
i 2(j+1) 
lim eau | =2? <1 forall x € (-1,1). 
joo [egy] 


The same conclusion holds if k = 2j + 1. Because Qh (x) = (1 — #?)7', 


lim | p(x)Qo(x) = 


2 


whereas 


lim, p(2)Qo(x) = 
4.5 The first two formulas follow from the fact that P, is an even function 
when n is even, and odd when n is odd. 
(—1)"(2n)! 1)” (2n — 1)---(8)(1) 


a E ) 
Pon (0) = a0 = ~ = ( Cn) (2) 
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4.7 Differentiate Rodrigues’ formula for P,, and replace n by n + 1 to obtain 


1 d” 
2n! dx” K 


PA= 2n + 1)x? — 1) (£? -1)"""], 


then differentiate Pa—ı and subtract. The first integral formula follows 
directly from (4.14) and the equality P (+1) = (+1)”. The second results 
from setting x = 1. 


4.11 (a) 1- x°? = P(x) — 2P,(x) — 2P; (x). 
(b) |x| = 5Po(x) + 3Po(x) — i Pa(z) + 


4.13 f(a) = ae CnPn(x), where cn = (2n + 1/2 f(x)Pa(x)dz. Because 


f is odd, cn = 0 for all even values of n. For n = 2k + 1, 


1 
C2k+1 = (4k T° 3) | Pog (x)dax 
vA 


= (4k + 3) ——~ x E [P2x(0) — P2r+2(0)] 


(2k)! (2k + 2)! 
= Ez T PFE 1)! 


= (y CO! 4k +3) 
22h KIKI (2k + 2)’ 


k € No. 


Hence f(x) = 3 Pi (x) = £P3(z) + H P; (x) +- . Atz=0, 


4.15 ([ ea) =4 fE fe OP dandy =4 f° 7? er drdd =r. 


4.16 Replace t by —t in Equation (4.25) to obtain 


> Lm = en 2at-t? si D T a(o)”, 


n=0 
which implies H,,(—2x) = (—1)” Ha (x). 
4.17 Setting x = 0 in (4.25) yields 
X 1 ge ae PE NEA 
zj HCO)" =e" =) (-1) ae ; 
k=0 n=0 


By equating corresponding coefficients we obtain the desired formulas. 
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4.19 If m= 2n, 7 
en _ (2n)! 5y Hop(x) 
pn 2 (2k)l(n — K)! 
If m= 2n+ 1, 
on+1 _ (2n+1)! Å: Həkr+ı(£) R 
TO = gan Gea “he TEN 


k=0 


4.23 Use Leibnitz’ rule for the derivative of a product, 
(n) _ Sfr (n—k) (k) 
(fa =>) ( ae g" 
k=0 
with f(x) = xz” and g(x) =e”. 


4.25 Tan y CnLn(x), where aal e *a™L,(x)dx = (—1)” 


0 ni(m -= n)! 


x 1 2 
4.28 u(x) =o to f de= +0, (Iog.2 + 2+ sate) : 
x ! 
4.29 The surface y = 7/2, corresponding to the xy-plane. 
4.31 The solution of Laplace’s equation in the spherical coordinates (r, vy) is 
given by Equation (4.42). Using the given boundary condition in Equation 
(4.43), 


2 1 am /2 
Qn = ng | 10P,,(cos y)sin y dy 


o 5(2n+1) ft 
ies ane f Pa(x)dx 


= Pri) Pai),  nEN, 


where the result of Exercise 4.7 is used in the last equality. We therefore 
arrive at the solution 


urg =5 45 5. [Pn—1(0) zP e0] (=) " P,(cos 2] 


3 7 3 
=5 i+ 5 Pi (cos 9) -3 (a) P3(cos g) + | . 

Note that u(R,y) — 5 is an odd function of y, hence the summation 
(starting with n = 1) is over odd values of n. 


4.33 In view of the boundary condition u,(r,7/2) = 0, f may be extended as 
an even function of y from [0,7/2] to [0,7]. By symmetry the solution 
is even about y = 7/2, hence the summation is over even orders of the 
Legendre polynomials. 
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Chapter 5 


5.1 


5.3 


5.5 


5.27 
5.29 


For all n € N, the integral I,(x) = fọ e~'t®~ ‘dt is a continuous function 
of x € [a,b], where 0 < a < b < ow. Because 


CO loc) 
0 <f eas f a de & 0, 
n n 


it follows that I, converges uniformly to I(x). Therefore I(x) is continu- 
ous on [a, b] for any 0 < a < b < ov, and hence on - oo). By a similar pro- 
cedure we can also show that its derivatives I’(x) = Jy e~t”! log tdt, 
I(x) = f° et- (log t)? dt, --- are all eta on (0,00). 

1 (2n)! 
(2) 


r (n+4)= (n—$)--- (4) 0 G) = amA (3)- From Exercise 5.2 we 


know that I (5) = y7. 


Use the integral definition of the gamma function to obtain 
pepe (=+5 5) = 5 o ee (8+6 (2)??-1(a + B)dadg, 


then change the variables of integration to £ = a? + 8°, n = 2a to arrive 
at the desired formula. 


Apply the ratio test. 
Differentiate Equation (5.12) and multiply by z. 
Substitute v = —1/2 into the identity and use Exercise 5.8. 


Substitute directly into Bessel’s equation. Note that, whereas J;,(x) is 
bounded at x = 0, yn (x) is not. Hence the two functions cannot be linearly 
dependent. 


The definition of I„, as given in Exercise 5.22, extends to negative values 
of v. Equation (5.18) is invariant under a change of sign of v, hence it is 
satisfied by both J, and I_, 


Follows from the bounds on the sine and cosine functions. 
Applying Parseval’s relation to Equations (5.22) and (5.23), we obtain 


i. cos?(x sin 0)d0 = 2r Jê (x) + 4r 5 dente) 


=T m=1 


/ sin?(a sin 0)d0 = 4r 5 J2n-1(2). 
EIN m=1 


By adding these two equations we arrive at the desired identity. 


Solutions to Selected Exercises 253 


5.31 Apply Lemma 3.7 to Equations (5.24) and (5.25). 


b b? 
5.33 (a) (1, Jo(ina))a= fy Jo(unr)ade = n wD Joona) e= = Ft (Hrb). 
k 


Therefore 3 
2 1 
l=- —— J (1,2). 
b kel MJ (Hy) 
5 b’ 4b 
(c) (a?, Jo(u,2)),, = | — — ~ } Ji(s_b). Hence 
Hk Hk 


2 -4 
Jhe Lk Jolups). 
5 n OM) 


b/2 b 
(e) (f, Joliu2))» = A Jo(mz)zdz = il Hence 


olu 
b > ne cae K2). 


5.35 From Exercises 5.13 and 5.14(a) we have (x, Ji(u;,2)),, =f J (piri a 


dx = —Jo(ug)/Ug = J2(Hp)/ Hg, and, from Equation (5.34), lJi (ua) = 
4 J2 (up). Therefore 


2=25 Jı(ugr), O0<a<l. 
k=1 


5.37 Using the results of Exercises 5.13 and 5.14(a), 
1 
1 
Da = | PAdo = = [uA ln) ~ we Jol) 
0 k 


1 
= — J(u). 
Hk 


Bessel’s equation also implies 


1 Ay? —1 
[J (us) = UF (24)? + gz (Mite = 1) Jf (2m) = 2 Ji (2m) 
Consequently, 
a =257 Hn Talo Jy ([0,,2), O<a<2. 
kai (44% — 1)J OT 


This representation is not pointwise. At x = 1, f(1) = 1 whereas the 
right-hand side is 5[f(1*) + f(1~)] = ż. 
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5.39 Assuming u(r, t) = v(r)w(t) leads to 


Solve these two equations and apply the boundary condition to obtain 
the desired representation for u. 


5.41 Use separation of variables to conclude that 


Co 


u(r, t) = X Jo(upr)[agcos ppct + bysin pgct), 
k=1 


2 R 
ak = mr, f(r) Jo(u,.7)rdr, 


2 R 
= —— r) J r)rdr. 
ea g(r) Jolu) 


Chapter 6 


6.1 (a) FE zE a(l —cos&). (c) O) = $ (1 = eW 8) 


6.3 For any fixed point € € J, let €, be a sequence in J which converges to 
€. Because 


IF) - F(E) < f lolz, £n) — ola, £)| da, 


and |y(z,€,) — y(x,€)| < 2g(x) € L! (I), we can apply Theorem 6.4 to 
the sequence of functions y(x) = y(x, En) — p(z, E) to conclude that 


Jim FE) -FOIS jim, f ole.g,) -oE 8de 


= f lim Jole, £n) — ole, )| dz =0. 


n—oo 


6.5 Suppose € € J, and let €,, — £. Define 


v(x, En) T plx, €) 
En zi E i 


then Y, (z,) —> pe(x,Ẹ) pointwise. Y, is integrable on J and, by the 
mean value theorem, Y„(x, £) = Yn (£, Nn) for some 7,, between €,, and £. 


Phala, E) = 
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6.8 
6.9 


6.10 


6.17 


Therefore |Y, (£, £)| < h(x) on I x J. Now use the dominated convergence 
theorem to conclude that f; ,,(2,€)dx > J, ye(x, €)dx. This proves 


FEn) — FE) 
ae, O 


The continuity of F” follows from Exercise 6.3. 
(a) 1, (b) 1/2, (c) 0. 
Express the integral over (a,b) as a sum of integrals over the subintervals 


(a, £1), ---, (£n, b). Because both f and g are smooth over each subinter- 
val, the formula for integration by parts applies to each integral in the 


sum. 
(a) f is even, hence B(€) = 0, A(€) = a sin x cos éx dx = e ds, 
2 [®© 1+ cos x€& : a 
and f(z) = a TEE TE dé. 
7 2 ~€ —sin€ ae 
O fla) = =f SEP sinag d, 
Define 


e`” cosx x>0 
Ga 
—e” cos £, x <0. 


Because f is odd its cosine transform is zero and 
2¢° 
EtA 
Now f(x) may be represented on (—o0, co) by the inversion formula (6.28), 


B(£) = 2 | e “cosxsin€x dz = 
0 


2 Co 3 
e *cosx = =| é sin xé dé. 
T Jo 


E +4 
Because f is not continuous at x = 0, this integral is not uniformly 
convergent. 
Extend 


1, O<E<T 
CET L>T 


as an odd function to R and show that its sine transform is B(€) = 
2(1 — cos 7&)/E€. 
Show that the cosine transform of f is 


1—cosé  sin?(€/2) 
Ae) = 2-9 = SE 
£ (€/2) 
Express f(x) as a cosine integral and evaluate the result at x = 0, which 
is a point of continuity of f. 
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6.19 


6.21 


6.27 
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AI 2 
Equation (6.31) implies that l| = JAJ? + IB]? = 27 (717. 


w,(x) decays exponentially as |x| —> 00, so it belongs to £L!(R) and % 
therefore exists. From Example 6.17 we have Yo (£) = v2rp(£). Assum- 
ing Pha (E) = (=i) V 2mp (E), we have 


hnilé) =F (er? *Hns1(2)) (©) 
= F [e**/?(20H, (0) — Hi (2))] (© 
= F [rpn (2) — val) (€) 
= ib, (8) — EDn lE) 
=i) t V 2| -p (6) + Ebn(6)] 
Verb 41(2), 


where we used the identity Hn4ı(£) = 2xHn(£) — H! (x) and Theorem 


6.15. Thus, by induction, w,,(€) = (—i)"V277,,(€) is true for all n € No. 


Define the integral I(z) = Pe e7? cos z£ d and show that it satisfies 
the differential equation I’(z) = —zI(z)/2b, whose solution is I(z) = 


1(0)e~*"/4, where I(0) = 4,/7/b. 


The boundary condition at x = 0 implies A(A) = 0 in the representation 
of u(x,t) given by (6.39), so that u is now an odd function of x. By 
extending f(a) as an odd function from (0,00) to (—00,00) we can see 
that B(A) is the sine transform of f and the same procedure followed in 
Example 6.18 leads to the desired result. 


The transformed wave equation ti(€,t) = —c?€?&(€,t) under the given 
initial conditions is solved by (£€,t) = f(€)cosc&ét. Taking the inverse 
Fourier transform yields the required representation of u. 


Chapter 7 
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7.2 (b) 2cosh 3z — ° sinh 3a. 
1 


(d) z (1 -= ert) . 
(£) 2./a/z. 
7.5 f(x) =2[H(x) — H(z — 1)] + e!7®H(x -— 1). 
1 1 1 
= 1 i B } Te 
ANO=F a-e- tete 
7.6 (c) H(a—3)+ H(x-1). 
7.7 If f has jump discontinuities at the points z1, .. . , &n then the sum f(z; )— 
f(at) +--+ f(a7z) — f(x}) has to be added to the right-hand side of 
(7.6). 


7.8 (e) y(x) = H(z -— 1) [zee — e”! 4 ;| —e* +e”. 
7.9 (c) : (ee), 
7.11 (a) Write 

UP = f Heede 


0 
is (n+1)p 
= Be f(aje "dx 


n=0" PP 
O. -PPR 
=X | Het npe Pdz, 
n=079 
then use the equation f(x + np) = f(x) to arrive at the answer. 


wen [ža e=’) =). 


Izen | 8? s 
7.13 The left-hand side is the convolution of x? and y(x). Applying Theorem 
7.14 gives 3!Y(s)/s+ = F(s), from which Y (s) = s*F(s)/6. From Corol- 
lary 7.7 we conclude that 


ya) = EF O(a) + LAOH + f(O) + FOF) 9 + f” (0). 


The integral expression for f(x) implies that f° (0+) = 0 for n = 0,1,2,3 
(we also know from Exercise 7.12 that s” cannot be the Laplace transform 
of a function in € for any n € No). Assuming that f is differentiable to 
fourth order (or that y is continuous), the solution is y(x) = f («)/6. 
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—s 


s(1 — e75) 
7.17 u(x,t) = H(t — x/c)cos?(t — z/c). 
7.19 u(a,t) = e~*/*H(t — x/c)sin(t — z/c). 


7.15 L((z})(s) = 


7.21 F(s) = e~*V*/,/s is analytic in the complex plane cut along the negative 
axis (—oo, 0]. Using Cauchy’s theorem, the integral along the vertical line 
(3 — ico, B + ico) can be reduced to two integrals, one along the bottom 
edge of the cut from left to right, and the other along the top edge from 
right to left. This yields 
1 B+ioo 
LTH(F)(£) = — F(s)e*"ds 


J eal ods 


= z e™®? cos at dt. 
T Jo 


Noting that the last integral is the Fourier transform of e 
ing the result of Example 6.17, we obtain the desired expression for 
LG ye): 


2 
=2t and us- 
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N = {1,2,3,. 


Notation 


st 


No = {0,1,2,3,...} 


= {...,—2, —1,0,1,2,...} 


rational numbers 

real numbers 

complex numbers 

R or C, 1 

set of continuous functions on the interval J, 3,5 

functions on J which have continuous derivatives up to order 
k, 5 

functions on I which have continuous derivatives of all orders, 5 
Lebesgue integrable functions on the interval J, 186 

Lebesgue square-integrable functions on Z, 15 

Lebesgue square-integrable functions on I with respect to the 
weight function p, 18 

locally integrable functions on [0, 00) of exponential growth at 
oo, 221 

inner product in £7, 6,14 

inner product in Le 18 

L? norm, 14 

L norm, 18 

the sequence of functions fn converges pointwise to f, 20-21 
fn converges uniformly to f, 22,23 


fn converges to f in £2, 31-32 
Fourier transform of f, 187 
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F=CL(f) Laplace transform of f, 223 
fxg convolution of f and g, 215 
fr positive and negative parts of the function f, 111 
L adjoint of the differential operator L, 55-56 
L* formal adjoint of L, 57 
A Laplacian operator, 119 
W(f,g) Wronskian of f and g, 45 
G(x, £) Green’s function, 68-69 
) Dirichlet’s kernel, 106 


Dna 
erf error function, 160 

Py, Legendre polynomial of order n, 133 

Qn Legendre function of order n, 134 

Ay, Hermite polynomial of order n, 141 

In Laguerre polynomial of order n, 146 

Jo Bessel function of the first kind of order v, 162 
Y, Bessel function of the second kind of order v, 169 
Ba Ky modified Bessel functions of order v, 171 

T gamma function, 157 

H Heaviside function, 222 


Notation 


analytic function 43 
Ascoli—Arzela theorem 75 


basis 4 

Bessel functions 

— generating function for 171 

— integral representation of 172 
— modified 171 

— of the first kind 160, 162 

— of the second kind 169 

— orthogonality of 174, 175, 177 
beta function 159 
Bolzano-Weierstrass theorem 247 
boundary conditions 44 

— homogeneous 44 

— periodic 44 

— separated 44 

boundary-value problem 44 


Cauchy principal value 198 
Cauchy sequence in £? 33 
Cauchy-Euler equation 43 
Completeness of £? 33 
Completeness theorem 83 
convergence of sequence 

— in £? 31, 32 

— pointwise 20 

— uniform 22 
convergence of series 

— absolute 25 

- in £? 33 

— pointwise 26 

uniform 26 


| 


convolution 215 


cylindrical coordinates 180 


differential equation 


| 


| 


homogeneous 41 
linear 42 
nonhomogeneous 41 
regular 42 

singular 42 


differential operator 


adjoint 56-57 

formal adjoint of 57 
formally self-adjoint 57 
self-adjoint 57 


Dirichlet integral 193 
Dirichlet kernel 106 

Dirichlet problem 180 
dominated convergence theorem 188 


eigenfunction 58 
eigenvalue 56 
eigenvector 56 
electric capacitor 152 
equation 


Bessel 160 
Cauchy—Euler 43 

heat 118-120, 122, 123 
Hermite 89, 143 
Laguerre 89, 146 
Laplace 148-153 
Legendre 89, 130 
Schrödinger 154 
telegraph 236 

wave 124-126 
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error function 160 
Euclidean space 9 
exponential order 222 


Fourier coefficients 97 

Fourier integral 198 

Fourier series 

— exponential form 100 

— fundamental theorem of 97 

— generalized form of 129 

— L? convergence of 93-102 

— pointwise convergence of 102-117 
— trigonometric type 95-97 

— uniform convergence of 112, 114 
Fourier—Bessel series 176 
Fourier—Legendre series 138 
Fourier transform 187 

— cosine transform 201 

— sine transform 201 


gamma function 157-159 
Gram-Schmidt method 12 
Green’s formula 66 
Green’s function 68 


harmonic oscillator 153 

Heaviside function 222 

Hermite polynomials 141, 143, 145, 147, 
148 

— generating formula for 141 

Hilbert space £? 34 

— completeness of 33 

— density theorem for 34 


inequality 

— Bessel’s 38 

— Cauchy—Bunyakovsky—Schwarz 
(CBS) 7 

— triangle 8 

initial conditions 42 

initial-value problem 42 

inner product 6 

inner product space 6 

inverse Fourier transform 198 

inverse Laplace transform 226 

isolated zero 49 


Lagrange identity 66 

Laguerre polynomials 145, 146 

Laplace transform 223 

Laplacian operator 120 

Lebesgue dominated convergence 
theorem 188 

Legendre function 134 

Legendre polynomials 133 

— generating function for 143 

— Roderigues’ formula for 136 
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Leibnitz’ rule 251 

linear independence 3 

linear operator 55 

— adjoint of 55 

— self-adjoint 55 

linear vector space 1 

locally integrable functions 215 
norm 

— of operator 76 

— of vector 7 


orthogonal vectors 11 
orthonormal vectors 11 


Parseval’s (completeness) relation 38 
piecewise continuous function 103 
piecewise smooth function 104 
Plancherel’s theorem 204 

potential function 123 

power series 30 

projection vector 12 


Riemann—Lebesgue lemma 189 


sequence of functions 20 

sine integral 241 

solution of differential equation 41 

— by separation of variables 117 

— oscillatory 53 

— zeros of 49 

spherical capacitor 152 

spherical coordinate 148 

Sturm comparison theorem 51 

Sturm separation theorem 50 

Sturm-—Liouville (SL) eigenvalue problem 
58, 67 

— completeness of eigenfunctions for 
79-88 

— existence of eigenfunctions for 68-79 

— regular 67 

— singular 68, 88 


translation theorem 232 
uniform convergence of integral 196 


vector space 1 

— complex 2 

— dimension of 4 
— real 2 

— subspace of 4 


weight function 18 

Weierstrass approximation theorem 81 
Weierstrass M-test 26 

Wronskian 45 


